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(54) Title: USE OF AMINO ACID SEQUENCES INVOLVED IN THE ELONGATION OF FATTY ACIDS IN IDENTIFYING 
AND/OR DEVELOPING COMPOUNDS FOR PREVENTING AND/OR TREATING METABOLIC DISEASES 

(57) Abstract: The present invention relates to methods for the identification and/or the development of compounds that can be 
used to prevent and/or to treat metabolic diseases, and to the use of amino acid sequences that are involved in the elongation of fatty 
acids in such methods. The invention also relates to compounds that have been identified and/or developed using said methods, to 
pharmaceutical compositions that contain such compounds, and to the use of said compounds in the preparation of pharmaceutical 
compositions, in particular of pharmaceutical compositions for the prevention and/or treatment of metabolic diseases. The inven- 
tion further relates to compounds that can interact with amino acid sequences that are involved in the elongation of fatty acids, to 
pharmaceutical compositions that contain such compounds, and to the use of said compounds in the preparation of pharmaceutical 
compositions, in particular of pharmaceutical compositions for the prevention and/or treatment of metabolic diseases. 
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Use of amino acid sequences involved in the elongation of fattv acids in identifying 
and/or developing compounds for preventing and/or treating metabolic diseases. 

The present invention relates to methods for the identification and/or the 
5 development of compounds that can be used to prevent and/or to treat metabolic diseases, 
and to the use of amino acid sequences that are involved in the elongation of fatty acids 
in such methods. 

The invention also relates to compounds that have been identified and/or 
developed using said methods, to pharmaceutical compositions that contain such 
10 compounds, and to the use of said compounds in the preparation of pharmaceutical 
compositions, in particular of pharmaceutical compositions for the prevention and/or 
treatment of metabolic diseases. 

The invention fiirther relates to compounds that can interact with amino acid 
sequences that are involved in the elongation of fatty acids, to pharmaceutical 
1 5 compositions that contain such compounds, and to the use of said compounds in the 

preparation of pharmaceutical compositions, in particular of pharmaceutical compositions 
for the prevention and/or treatment of metabolic diseases. 

Other aspects, embodiments, applications and advantages of the present invention 
will become clear from the further description below. 
20 The present invention was established on the basis of the finding that amino acid 

sequences that are involved in the elongation of fatty acids (as fiarther described below) 
can be used as (potential) "target(s)" for in vitro and/or in vivo interaction with chemical 
compounds and other factors (with the term "target having its usual meaning in the art, 
vide for example the definition given in WO 98/06737), Consequently, compounds 
25 and/or factors that have been identified as interacting with amino acid sequences that are 
involved in the elongation of fatty acids (e.g. using the methods described herein) may be 
useful as active agents in the pharmaceutical field, and in particular for the prevention 
and/or treatment of metabolic diseases. 

Pathways for the de novo biosynthesis of fatty acids, as well as the amino acid 
30 sequences/enzymes tiiat are involved in such pathways, have been described in the art. 
See for example: 
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- Leonard et al., Biochem. J. (20O0), 350, p. 765-770), for the elongation pathway in 
Saccharomyces cerevisiae; 

- Tvrdik et al. The Journal of Cell Biology, Vol.149 (2000), p.707-717; Tvrdik et al, J. 
Biol. Chem., Vol. 272, No.50, p. 31738-31746 (1997); Moon et al., J. Biol. Chem., 

5 Vol. 276, No. 48, p. 45358-45366 (2001); Matsuzaka et al., J. Lipid. Res., Vol. 43 

(2002) , 911-920; and Moon and Horton, J. Biol. Chem., Vol. 278, Nd.9, 6335-7343 

(2003) , for the elongation pathway in mice and humans. 

In animals, fatty acids are generally synthesized de novo from acetyl-CoA and 
stearyl-CoA through a series of reactions mediated by acetyl-CoA carboxylase (also 

10 referred to as "^CC) and fatty acid synthase (also referred to as 'TAS""). Reference is 
inter alia made to Moon et al, supra. 

In mammals, this de novo sjmthesis principally results in, and ends with (i.e. for 
about 90%, with about 10% of higher fatty acids such as stearic acid being formed), the 
fatty acid pahnitic acid (which, in accordance with standard fatty acid nomenclature, is 

15 often designated as "16:0", in which "16" denotes tihe number of carbon atoms in the 
fetty acid carbon chain - i.e. Ci6 - and "0" denotes the number of unsaturated bonds, i.e. 
0), see again Moon et al., supra. This is released into the cytosol, from where it is either 
further elongated - i.e. to stearic acid (1 8:0, which is the main saturated product resulting 
from the elongation reaction) and/or is desaturated (e.g. from 16:0 to 16:1 for pahnitic 

20 acid, or from 18:0 to 18: 1 for stearic acid, etc.). 

The elongation of fatty acids is performed in the mitochondria and more 
commonly in the endoplasmatic reticulum by a complex of multiple enzymes, and 
involves a cycle of condensation (referred to herein as the "First Step"), reduction 
(referred to herein as the "Second Step"), dehydration (referred to h^ein as the 'Third 

25 Step") and reduction (referred to herein as the "Fourth Step"), which are as follows (see 
Moon and Horton, supra): 

First Step: condensation between a fatty acyl-CoA and malonyl-CoA to form 3- 
ketoacyl-CoA; 



30 



wo 2005/030985 



3 



PCT/GB2004/004103 



Second Step: reduction of the 3-ketoacyl-CoA using NADPH to form 3-hydroxyacyl- 
CoA; 

Third Step: dehydration of 3-hydroxyacyl-CoA to /ra«^-2,3-enoyl-CoA; 

5 

Fourth Step: reduction of fra«^-2,3-enoyl-CoA to saturated acyl-CoA. 

This cycle results in the addition of two carbon atoms to the fatty acid chain, 
compared to the starting fetty acid. The fatty acid chain tiius elongated may then either be 

10 further elongated through another cycle of condensation, reduction, dehydration and 
reduction; and/or may be desaturated. In turn, the further elongated and/or desaturated 
fatty acids thus obtained may then again be further elongated and/or desaturated. In this 
way, starting from palmitic acid as the main product from the de novo synthesis by the 
FASIACC system, a whole range of saturated and unsaturated fatty acids may be 

15 synthesized. 

However, some families of biologically important fatty acids cannot be 
synthesized by animals, because fhey lack some of the required enzymes. For instance, 
when it comes to the desaturation of saturated fatty acids and/or the further desaturation 
of unsaturated fatty acids, animals lack the desaturases required for introducing a double 

20 bond beyond the 9* position in the carbon chain, and can only insert additional double 
bonds between an existing double bond and the terminal carboxylgroup. 

Thus, for instance, animals cannot synthesize (the precursors for) the fatty acids 
of the so-called series" (in which, in accordance with standard biochemical 
nomenclature, "n" denotes the number of carbon atoms and "(f ' denotes the position of 

25 the last double bond, counting from the methyl group determining the metabolic family) 
and the "(n-3) series", which include for example linoleic acid (18:2(n-6)), the primary 
precursor for arachidonic acid (20:4 (n-6)) and other fatty acids of the (n-6) series, and 
a/;7Aa-linolenic acid (18:3(n-3)), the main precursor for fatty acids of the (n-3) series. 
Animals must therefore take up these essential fatty acids from plant sources as part of 

30 their diet (mainly in the form of linoleic acid and alpha-linolenic acid); the fatty acids 
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thus taken up may then be elongated and/or further desaturated to provide other fatty 
acids of the (n-6) and (n-3) series, again via the steps outlined above. 

The cycle of elongation steps described above is performed by a system of 
multiple enzymes comprising: 

5 

First Step: a condensing enzyme; 
Second Step: a p (beta)-oxoacyl-CoA reductase; 
10 Third Step: a p (beta)-hydroxyacyl-CoA dehydrase; 
Fourth step: a trans-2-enoyl-CoA reductase; 

which enzymes are also collectively referred to as ^'elongation enzymes'' or ''elongases" 

15 (see for example Moon et al, supra), and sometimes also as the yatty acid chain 
elongation system'' or ''FACES" (see for example Leonard et al, sup?'a). 

It is generally assumed that the condensation (the First Step above) is catalysed by 
different members from a group of essentially similar but distinct condensation enzymes, 
each of which has substrate specificity for a fatty acid (or for a series of fatty acids) with 

20 a specific number (or range) of carbon atoms in the fatty acid chain. For example, in S. 
cerevisiae, it has been shown that there is one condensation enzyme - called "ELOV- 
which has specificity for the elongation of C14:0 fatty acids into C16:0 fatty acids 
(compared to for instance the elongation of C16:0 into C18:0, see Toke et al., J. Biol. 
Chem., VoL27 1, No. 3 1, p, 18413-18422 (1 996)) and that there are two other 

25 condensation enzymes - called ''ELOTTFENr and ''EL03'TSUR4", respectively - 

which have specificity for the condensation/elongation of fatty acids up to C24 and for the 
condensation/elongation of C24 fatty acids ixito C26 fatty acids, respectively (see Oh et ah, 
J. Biol. Chem., Vol.272, No. 28, p. 17376-1 7384 (1997). 

The functional equivalents of EL02 and EL03 from mice have been described as 

30 "CigSO" and "Sscl", respectively (see Tvrdik et al. (2000), supra). Also, a condensation 
enzyme from mice with specificity for the conversion of C16 to C18 fatty acids - called 
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"iCF' - and the sequence of its human oifholog have been identified (see Moon et al., 
supra^ in particular flie alignment in Figure 2 oxi page 45361). 

The art also describes several assays for determining the activity and/or 
(substrate) specificity of elongases with respect to different fatty acids, for example the 
"i/z vitro fatty acid elongation assay'' described in Moon et al. (supra), the ''fatty acid 
elongation assay"" described by Tvrdik et al., (2000, supra) and/or the 'fatty acid 
elongation assay described by Matsuzaka et al. {supra). As indicated therein and as 
fiirflier described below, such assays may also be performed with for example 
microsomal preparations, which may contain two or more or essentially all enzymes 
involved in the First Step to Fourth Step above, so as to measure the activity of the entire 
pathway, or the influence of a single enzyme on said pathway. 

The (proposed) biological fimction of mammalian elongation enzymes can also be 
determined^confirmed by (over) expression of cDNA encoding said enzymes in S. 
cerevisiae (as performed for the human elongase HELOl (= ELOVL5) by Leonard et aL, 
supra) or in a mutant ofS. cerevisiae that lacks one or more of the native elongations 
enzymes ELOl, EL02 and/or EL03 (see Toke et al. and Oh et al., both supra). 

The nucleotide sequences and proposed amino acid sequences for the human 
condensation enzymes involved in the First Step above - which are also collectively 
referred to as the "ELOVL" family - have also been described in the art, i.e. as follows: 
- ELOVL6 , which is the human ortholog of the mouse elongase LCE, which has . 
specificity for the conversion of Cu and Ci6 saturated and mono-unsaturated fatty 
acids, such as for the conversion of Cieto Cis fatty acids, and in particular for the 
conversion of palmitic acid (16:0) to stearic acid (18:0). The nucleotide sequence of 
EL0VL6 is given in SEQ ID NO: 1 (mRNA sequence) and SEQ ID NO: 2 (mRNA - 
coding sequence); the amino acid sequence is given in SEQ ID NO: 3. See also 
Genkbank accession numbers NM_024090.1 (gi 13 129087) for the nucleotide 
sequence and NP_G76995.1 (gi 13129088) far the amino acid sequence. Reference is 
also made to Moon et al., supra.; By (mRNA. sequence) it is meant that the nucleotide 
sequences listed are derived from the full mRJSfA sequence (including introns). By 
(mRNA-coding sequence) it is meant that the nucleotide sequences listed are derived 
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&om the mRNA sequence which encodes the amino acid sequence, that is with 
introns removed. 

- EL0VL3 , which is the human ortholog of the mouse elongase CigSO and the 5^. 
cerevisiae elongase EL02, with specificity for very long chain fatty acids (up to C26). 
The nucleotide sequence of ELOVL3 is given in SEQ ID NO: 6 (mRNA sequence) 
and SEQ ID NO: 7 (mRNA - coding sequence); the amino acid sequence is given in 
SEQ ID NO:8. See also Genkbank accession numbers NM_152310 (gi 23097309); 
nucleotide sequence) andNP_689523.1 (gi 23097310; amino acid sequence). 
Reference is also made to Tvrdik (1997), supra, and to Moon et aL, supra. 

- EL0VL5 (also called "HELO 1"), which is involved in the elongation of 
polyunsaturated fatty acids (PUFA's). The nucleotide sequence of EL0VL5 is given 
in SEQ ID NO: 9 (mRNA sequence) and SEQ ID NO: 10 (mRNA - coding 
sequence); the amino acid sequence is given in SEQ ID NO: 1 1 . See also Genkbank 
accession numbers NM_02 1814 (gi 21361 903) for the nucleotide sequence and 
NP_068586.1 (gi 1 1464975) for the amino acid sequence, amino acid sequence). 
Reference is also made to Leonard et al., supra/, 

EL0VL4, which is involved in the elongation of very long chain fatty acids. The 
nucleotide sequence of EL0VL5 is given in SEQ ID NO: 12 (mRNA sequence) and 
SEQ ID NO: 13 (mRNA - coding sequence); the amino acid sequence is given in 
SEQ ID NO:14. See also Genkbank accession numbers NM_022726 (gi 21362099) 
for the nucleotide sequence andNP_073563.1 (gi 12232379) for the amino acid 
sequence. 

ELOVL2, which is the human ortholog of the mouse elongase Ssc2, which has been 
shown to accept arachidonic acid and eicosapentaneoic acid as its substrates. The 
nucleotide sequence of EL0VL2 is given in SEQ ID NO: 15 (mRNA sequence) and 
SEQ ID NO: 16 (mRNA - coding sequence); the amino acid sequence is given in 
SEQ ID NO: 17. See also Genkbank accession numbers NM_17770 (gi 89233 1 1) for 
the nucleotide sequence and NP_060240.1 (gi 8923312) for the amino acid sequence. 
Reference is also made to Tvirdik (2000), supra, and to Moon et ah, supra, 
ELOVLl, which is the human ortholog of the mouse elongase Sscl and the S, 
cerevisiae elongase ELO 3, with specificity for very long chain fatty acids. The 
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nucleotide sequence of ELOVLl is given in SEQ ID NO: 18 (mRNA sequence) and 
SEQ ID NO: 19 (mRNA - coding sequence); the amino acid sequence is given in 
SEQ ID NO:20. See also Genkbank accession numbers NMJ2821 (gi 13489092) for 
the nucleotide sequence and NP_^073732.1 (gi 13489093) for the amino acid 
5 sequence. Reference is also made to Tvirdik (2000), supra, and to Moon at al., supra. 
In general, all the condensation enzymes described above contain a histidine-rich 
motif (HAXHH), and also contain an ELO-domain (as determined by SMART™ analysis 
rsee htt pr//smart.embl-heidelberg.deA and/or PFAM Tjrotein search™ (see for e?{:ample 
http://pfam.wustLedu/). For example, the ELO domain in tiie amino acid sequence of 
10 SEQ ID NO: 3 comprises amino acids 10-265, 

In addition to the six enzymes from the ELOVL family above, bio-informatic 
analysis has also identified a further human amino acid sequence - shown in SEQ ID 
NO: 5 - that contains both an HA3HH motif (amino acids 24-28) as well as an "ELO" 
domain on PFAM protein search™ (amino acids 1-94). Although the invention is not 
15 limited to any specific hypoAesis or explanation, it is assumed that tiiis amino acid 
sequence is also involved in the elongation of fatty acids, and in particular in flie 
condensation between a fatty acyl-CoA and malonyl-CoA to form 3-ketoacyl-CoA (i.e. 
the First Step above). The nucleotide sequence for this hypothetical protein (partial CDS) 
is given in SEQ ID NO: 4 and the amino acid sequence is given in SEQ ID NO: 5. 
20 Reference is also made to Genbank accession number AL137506 (gi: 6808153) for the 
nucleotide sequence, the CAB 70777.1 (gi: 6808 154) for the amino acid sequence. These 
sequences and their use in the methods described herein forai further aspects of the 
invention. 

Compared to the condensation enzymes involved in the First Step, it is assmned 
25 that the subsequent steps of reduction (the Second Step above), dehydration (the Third 
Step above) and leduction (the Fourth Step above) are performed by enzymes that have 
less substrate specificity than the condensation enzymes described above. 

The human reductases that are involved in the Second Step and Fourth Step are 
described by Moon and Horton, supra: 
30 - ''3-ketoacyl-CoA reductase"' or ''KAK\ which is involved in the reduction of 3- 

ketoacyl-CoA using NADPH to form 3-hydroxyacyl-CoA (the Second Step above). 
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The nucleotide sequence of KAR is given in SEQ ID NO: 21 (mRNA sequence) and 
SEQ ID NO: 22 (mRNA - coding sequence); the amino acid sequence is given in 
SEQ ID NO: 23. See also Genkbank accession numbers NM_016142 (gi 7705854) 
for flie nucleotide sequence and NP_057226.I (gi 7705855) for the amino acid 
sequence. KAR has also been described in the art under the name "7 7-beta- 
hydroxysteroid dehydrogenase'' oV'HSD17BI2 
- ''tranS'2,3-enoyl'CoA reductase'' or 'TER", which is involved in the reduction of 
trans-2,3-enoyl-CoA to saturated acyl-CoA (the Fourth Step above). The nucleotide 
sequence of TER is given in SEQ ID NO: 24 (xriRNA sequence) and SEQ ID NO: 25 
(mRNA - coding sequence); the amino acid sequence is given in SEQ ID NO: 26. See 
also Genkbank accession numbers NM_004868 (gi 4759061) for the nucleotide 
sequence and NP_004859.1 (gi 4759062) for tihe amino acid sequence. TER has also 
been described in the art under the names ''synaptic glycoprotein 2" or "GPSN2", and 

KAR (and its homolog NM_03 1463 of SEQ ID NO. 33, which has been found by 
bio-informatic analysis, see the Examples below) both contain an ''adh_short" domain, as 
determined by PFAM analysis (amino acid residues 49-280 for KAR and amino acid 
residues 66-306 for NM_03 1463). TER contains a steroid _short' domain, again as 
determined by PFAM analysis, (amino acid residues 12-157). 

For the sake of completeness, it s mentioned herein that the desaturation of fatty 
acids (both saturated and unsaturated) is performed by enzymes which are referred to as 
^Jatty acid desaturases" . These inter alia include, but are not limited to, the enzymes 
known as the "stearoyl-CoA-desaturases" or "5CI> V. Examples of SCD'is are SCD-1 
and SCD-5 from humans, which are inter alia described in the Intemational applications 
WO 01/66758 and WO 02/26944, respectively. Reference is also made to Ntambi et al., 
Curr. Opin. Lipifol 14:255-261 (2003) and the further references therein. However, such 
desaturases fall outside the scope of the present invention. 

It has now been found that amino acids involved in the elongation of fatty acids 
can be used as targets (as described herein) in the field of metabolic disease. 

In paricular, it has been found that amino acid sequences that are involved in one 
of the four steps of the elongation cycle of fatty acids - i.e. the First Step, Second Step, 
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Third Step and Fourth Step as described above - can be used as targets in metabolic 
disease, as further described hereinbelow. Throughout the specification "amino acid 
sequence" is defined to include any peptide, polypeptide, protein or enzyme. Also 
included within the definition are firagments of any of the above which retain 
functionality in terms of elongation of fatty acids. 

According to one specific, but non-limiting embodiment of the invention, the 
amino acid sequence that is used in the invention is derived fiom multicellular animals, in 
particular from vertebrate animals, more in particular mammals, and even more in 
particular firom humans, as further described hereinbelow. 

According to one specific, but non-limiting aspect of the invention, the amino 
acid sequence that is used in the invention is chosen fi-om the elongases, as described 
above, and more in particular in the elongases that are involved in Ihe First Step, Second 
Step and/or Third Step above. 

According to one specific, but non-limiting aspect of the invention, the amino 
acid sequence used in the invention is chosen from elongases involved in the First Step 
above, i.e. firom the condensation enzymes involved in the condensation between a fatty 
acyl-CoA and malonyl-CoA to form 3-ketoacyl-CoA. 

More in particular, according to this aspect of the invention, the amino acid 
sequence used in the invention is chosen fi'om condensation enzymes involved in the 
First Step above that have specificity (as determined by a suitable assay, such as one of 
the assays mentioned hereinabove) for fatty acids with a length of the carbon chain of 24 
carbon atoms or less (such as palmitic acid, stearic acid, linoleic acid and/or linolenic 
acid), compared to fatty acids with a length of the carbon chain of more than 24 carbon 
atoms. 

Even more in particular, according to this aspect of the invention, the amino acid 
sequence used in the invention is chosen from condensation enzymes involved in the 
First Step above that have specificity (as determined by a suitable assay, such as one of 
the assays mentioned hereinabove) for fatty acids with a length of the carbon chain of 20 
carbon atoms or less (such as palmitic acid, stearic acid, linoleic acid and/or linolenic 
acid), compared to fatty acids with a lengtti of the carbon chain of more than 20 carbon 
atoms. 
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Still more in particular, according to this aspect of the invention, the amino acid 
sequence used in the invention is chosen from condensation enzymes involved in the 
First Step above that have specificity (as determined by a suitable assay, such as one of 
the assays mentioned hereinabove) for fatty acids with a length of the carbon chain of 18 
carbon atoms or less, compared to fatty acids with a length of the carbon chain of more 
tlian 18 carbon atoms; and specifically for fatty acids with a length of the carbon chain of 
16 or 18 carbon atoms, compared to fatty acids with a lengdi of the carbon chain of more 
than 1 8 carbon atoms 

Even still more in particular, according to this aspect of the invention, the amino 
acid sequence used in the invention is chosen from condensation enzymes involved in the 
First Step above that have specificity (as determined by a suitable assay, such as one of 
the assays mentioned hereinabove) for fatty acids with a length of the carbon chain of 16 
carbon atoms, compared to fetty acids with a length of the carbon chain of more than 16 
carbon atoms (such as stearic acid). Some preferred, but non-limiting examples of such 
enzymes from mammals are LCE from mice (see Moon et al., supra) and its human 
equivalent (see again Moon et al., supra, as well as SEQ ID NOS: 1, 2 and 3). 

Herein, the "substrate specificity" or "specificity" of an elongase can be 
determined using a suitable assay in a manner known per se, as will be clear to the skilled 
person. Generally, this will involve comparing two relev^ant substrates in such an assay, 
and determining which substrate is better accepted (e.g. more easily and/or faster 
converted) by the enzyme involved. For example, in the case of a condensation enzyme 
involved in the First Step above, the substrates may be tested in one of the in vivo or in 
vitro assays mentioned above and/or hereinbelow. 

Generally, using such a suitable assay, the substrate specificity can be 
determined/expressed by means of the "turnover" of the substrate(s) involved (i.e. the 
amount of substrate converted by the enzyme within a given period of time, usually 
measured at a pre-determined concentration), with a higher turnover of a first substrate 
compared to a second substrate being an indication that ftie first substrate is better 
accepted by the enzyme involved than the second substrate. Alternatively, the substrate 
specificity can be expressed using the well-known value, with a lower Km value of a 
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first substrate compared a second substrate being an indication that the first substrate is 
better accepted by the enzyme involved than the second substrate. 

Thus, accoTding to one particular non-limiting aspect, an enzyme is considered to 
have "specificity" for a first substrate compared to a second substrate if, in a suitable 
5 assay, either (1) the turnover of the first substrate is at least 1.1 times, preferably at least 
L5 times, more preferably at least 2.5 times, even more preferably at least 5 times, and in 
particular 10 times or more the turnover for the second substrate (as measured in tiie 
same assay using essentially the same conditions; and/or (2) the Km value of the first 
substrate is less than 90%, preferably less than 50%, more preferably less tiian 25%, even 
10 more preferably less than 1 0% of the Kn, value for die second substrate (as measured in 
the same assay using essentially the same conditions). 

According to one non-limiting embodiment of the aspect of the invention, the use 
of an amino acid sequence that is involved in - e.g. has specificity for - tiae elongation of 
unsaturated fatty acids that caimot be synthesized de novo by animals, or that are derived 
15 firom fatty acid precursors that cannot be synthesized de novo by animals (i.e. by 
elongation and/or further desaturation), is less preferred. 

In particular, according to this non-limiting embodiment of the aspect of the 
invention, the use of amino acid sequence(s) that are involved in the elongation of 
unsaturated fatty acids of the (n-3) family or (n-6) family are less preferred. 
20 More in particular, the use of amino acid sequence(s) that are involved in the 

elongation of unsaturated fatty acids of the (n-3) family or (n-6) family with a length of 
the carbon chain of less than 20 carbon atoms (such as linoleic acid and alpha-linolenic 
acid) are less preferred. 

Thus, according to one non-limiting embodiment of this aspect of the invention, 
25 the amino acid sequence(s) used as targets in the invention is/are chosen firom 
condensation enzymes involved in the First Step above that have specificity (as 
determined by a suitable assay, such as one of the assays mentioned hereinabove) for 
saturated fatty acids compared to unsaturated fatty acids of the (n-3) family and/or (n-6) 
family. 

30 According to one particularly preferred, but noa-limiting embodiment of this 

aspect of the invention, the amino acid sequence used in the invention is chosen from 
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condensation enzymes involved in the First Step above that liave specificity (as 
detennined by a suitable assay, such as one of the assays mentioned hereinabove) for 
saturated fatty acids with 24 carbon atoms or less in the fetty acid chain (such as palmitic 
acid and stearic acid), compared to both (1) saturated and uasaturated fatty acids with 
5 more than 24 carbon atoms in the fatty acid chain; as well as (2) unsaturated fatty acids of 
the (n-3) family and/or (n-6) family with 20 carbon atoms or less in the fatty acid chain 
(such as linoleic acid and alpha-linolenic acid). 

According to one particularly preferred, but non-limiting embodiment of this 
aspect of the invention, the amino acid sequence used in the invention is chosen J&om 

10 condensation enzymes involved in the First Step above that have specificity (as 

determined by a suitable assay, such as one of the assays mentioned hereinabove) for 
saturated fatty acids with 20 carbon atoms or less in the fatty acid chain (such as palmitic 
acid and stearic acid), compared to both (1) saturated and unsaturated fatty acids with 
more than 20 carbon atoms in the fatty acid chain; as well as (2) unsaturated fatty acids of 

15 the (n-3) family and/or (n-6) family with 20 carbon atoms or less in the fatty acid chain 
(such as linoleic acid and alpha-linolenic acid). 

According to one particularly preferred, but non-limiting embodiment of this 
aspect of the invention, the amino acid sequence used in the invention is chosen from 
condensation enzymes involved in the First Step above that have specificity (as 

20 determined by a suitable assay, such as one of the assays meationed hereinabove) for 
saturated fatty acids with 1 8 carbon atoms or less in the fatty acid chain, compared to 
both (1) saturated and unsaturated fatty acids with more than 1 8 carbon atoms m Ihe fatty 
acid chain; as well as (2) unsaturated fatty acids of the (n-3) family and/or (n-6) family 
with 18 carbon atoms or less in the fatty acid chain. 

25 More specifically, according to this preferred, but non-limiting embodiment, the 

amino acid sequence used in the invention is chosen fi:om coadensation enzymes 
involved in the First Step above that have specificity (as determined by a suitable assay, 
such as one of the assays mentioned hereinabove) for palmitia acid and/or for stearic 
acid, compared to both (1) saturated and unsaturated fatty acids with more than 18 carbon 

30 atoms in the fatty acid chain; as well as (2) linoleic acid and/ox alpha-linolenic acid. 



wo 2005/030985 



PCT/GB2004/004103 



13 

Even more specifically, according to this preferred, but non-limiting embodiment, 
the amino acid sequence used in the invention is chosen from condensation enzymes 
involved in the First St^ above that have specificity (as determined by a suitable assay, 
such as one of the assays mentioned hereinabove) for palmitic acid, compared to both (1) 
5 stearic acid; as well as (2) linoleic acid and/or alpha-linolenic acid. A preferred, but non- 
limiting examples of such enzymes fix>m mammals are LCE from mice (see Moon et al., 
supra) and its human equivalent (see again Moon et al,, supra, as well as SEQ ED NOS: 
1, 2 and 3), of which the latter is particularly preferred. 

According to one specific, but non-limiting aspect of the invention, the amino 
10 acid sequence used in the invention is chosen from reductases involved in the reduction 
of the 3-ketoacyl-CoA using NADPH to form 3-hydroxyacyl-CoA (i.e. the Second Step 
above) and/or in the reduction of trans-2,3-enoyl-CoA to saturated acyl-CoA (i.e. the 
Foiulh Step above). Some preferred, but non-limiting examples of such enzymes are 
KAR and TER, of which KAR is particularly preferred. 
15 la one more specific embodiment, the amino acid sequence used in the invention 

may be chosen from the group consisting of ELOVLl, ELOVL2, ELOVL3, ELOVL4, 
EL0VL5, EL0VL6, KAR and TER, and in particular from EL0VL6, KAR and TER. 

Thus, according to a first aspect, the invention relates to a method for identifying 
a compound that can be used in (the preparation of a pharmaceutical composition for) the 
20 prevention and/or treatment of metabolic diseases (e.g. firom a set or library of test 
chemicals), said method at least comprising the steps of: 

a) contacting an amino acid sequence that is involved in the elongation of fatty acids, 
and/or a host cell or host organism containing/expressing such an amino acid 
sequence, with a test chemical, in such a way that a signal may be generated that is 

25 representative for the interaction between said test chemical and said amino acid 
sequence; and optionally 

b) detecting the signal that may thus be generated, said signal identifying a modulator of 
said amino acid sequence; 

in which the modulator thus identified can be used in (the preparation of a pharmaceutical 
30 composition for) the prevention and/or treatment of metabolic diseases. 
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For example, the modulators thus identified can be used in (the preparation of a 
pharmaceutical composition for) the prevention and/or treatment of metabolic diseases, 
and/or can be used to develop other compounds that caa be used in (the preparation of a . 
pharmaceutical composition for) the prevention and/or treatment of metabolic diseases, 
5 as further described below. 

According to a further aspect, flie invention relates to a method for generating a 
signal that is representative for the interaction of a test chemical with an amino acid 
sequence involved in the elongation of fatty acids, said method at least comprising the 
steps of: 

10 a) contacting said amino acid sequence, or a host cell or host organism 

containmg/expressing said amino acid sequence involved in the elongation of fatty 
acids, with said test chemical, in such a way that a signal may be generated that is 
representative for the interaction between said test chemical and said amino acid 
sequence; and optionally 
15 b) detecting the signal that may thus be generated. 

In another aspect, the invention relates to a method for identifying a modulator of 
an amino acid sequence that is involved in the elongation of fatty acids (e.g. from a set or 
library of test chemicals), said method at least comprising the steps of: 

a) contacting said amino acid sequence, or a host cell or host organism 

20 containmg/expressing said amino acid sequence, with a test chemical, in such a way 
ttiat a signal may be generated that is representative for the interaction between said 
test chemical and said amino acid sequence; and optionally 

b) detecting the signal that may thus be generated, 

wherein a change in said signal (compared to a suitable reference) identifies a modulator 

25 of said amino acid sequence. 

The ''amino acid sequence involved in elongation of fatty acids'' may be any 
amino acid sequence that is involved in - e.g. catalyses - the First Step, the Second Step, 
the Third Step and/or the Fourth Step above in any unicellular or multicellular organism, 
in particular in yeast or in a multicellular animal, more in particular in a vertebrate 

30 animal, even more in particular in a mammal, and specifically in a human 
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As indicated above, the amino acid sequence involved in elongation of fatty acids 
is preferably chosen from the group consisting of: 

- condensation enzymes involved in the condensation between a fatty acyl-CoA and 
malonyl-CoA to form 3-ketoacyl-CoA; and 

- reductases involved in either the reduction of the 3-ketoacyl-CoA using NADPH to 
form 3-hydroxyacyl-CoA; and/or in the reduction of trans-2,3-enoyl-CoA to saturated 
acyl-CoA. 

More preferably, the amino acid sequence involved in elongation of fatty acids is 
chosen from the group consisting of: 

- condensation enzymes involved in the condensation betweea a fatty acyl-CoA and 
malonyl-CoA to form 3-ketoacyl-CoA that have specificity (as determined by a 
suitable assay, such as one of the assays mentioned hereinabove) for fatty acids with a 
length of the carbon chain of 24 carbon atoms or less; and 

- reductases involved in either the reduction of the 3-ketoacyI-CoA using NADPH to 
form 3-hydroxyacyl-CoA; and/or in the reduction of trans-2,3-enoyl-CoA to saturated 
acyl-CoA. 

Even more preferably, the amino acid sequence involved in elongation of fatty 
acids is chosen from the group consisting of: 

- condensation enzymes involved in the condensation between a fatty acyl-CoA and 
malonyl-CoA to form 3-ketoacyl-CoA that have specificity (as determined by a 
suitable assay, such as one of the assays mentioned hereinabove) for fatty acids with a 
length of the carbon chain of 20 carbon atoms or less; and 

- reductases involved in either the reduction of the 3-ketoacyl-CoA using NADPH to 
form 3-hydroxyacyl-CoA; and/or in the reduction of trans-2,3-enoyl-CoA to saturated 
acyl-CoA. 

Still more preferably, the amino acid sequence involved in elongation of fatty 
acids is chosen from tfie group consisting of: 

- condensation enzymes involved in the condensation between a fatty acyl-CoA and 
malonyl-CoA to form 3-ketoacyl-CoA that have specificity (as determined by a 
suitable assay, such as one of the assays mentioned hereinabove) for fatty acids with a 
length of the carbon chain of 1 8 carbon atoms or less; and 
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- reductases involved in either the reduction of the 3-ketoacyl-CoA using NADPH to 
form 3-hydroxyacyl-CoA; and/or in the reduction of trans-2,3-enoyl-CoA to saturated 
acyl-CoA. 

In particular, the amino acid sequence involved in elongation of fatty acids may 
be chosen from the group consisting of: 

- condCTsation enzymes involved in the condensation between a fatty acyl-CoA and 
malonyl-CoA to form 3-ketoacyl-CoA that have specificity (as determined by a 
suitable assay, such as one of the assays mentioned hereinabove) for saturated fatty 
acids with 18 carbon atoms or less in the fatty acid chain, compared to both (1) 
saturated and unsaturated fatty acids with more than 18 carbon atoms in the fatty acid 
chain; as well as (2) unsaturated fatty acids of the (n-3) family and/or (n-6) family 
with 18 carbon atoms or less in the fetty acid chain; and 

- reductases involved in either flie reduction of the 3-ketoacyl-CoA using NADPH to 
form 3-hydroxyacyl-CoA; and/or in the reduction of trans-2,3-enoyl-CoA to saturated 
acyl-CoA. 

According to a specific non-limiting aspect, the amino acid sequence involved in 
elongation of fatty acids may be chosen from the group consisting of: 

- condensation enzymes involved in the condensation between a fatty acyl-CoA and 
malonyl-CoA to form 3-ketoacyl-CoA that have specificity (as determined by a 
suitable assay, such as one of the assays mentioned hereinabove) for palmitic acid 
and/or for stearic acid, compared to both (1) saturated and tmsaturated fatty acids with 
more than 18 carbon atoms in the fatty acid chain; as well as (2) linoleic acid and/or 
alpha-linolenic acid; and 

- reductases involved in either the reduction of the S-ketoacyl-CoA using NADPH to 
form 3-hydroxyacyl-CoA; and/or in the reduction of trans-2,3-enoyl~CoA to saturated 
acyl-CoA. 

According to a very specific non-limiting aspect, the amino acid sequence 
involved in elongation of fatty acids may be chosen fi^om the group consisting of: 

- condensation enzymes involved in the condensation between a fatty acyl-CoA and 
malonyl-CoA to form 3-ketoacyl-CoA that have specificity (as determined by a 
suitable assay, such as one of the assays mentioned hereinabove) for palmitic acid, 
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compared to both (1) stearic acid; as well as (2) linoleic acid and alpha-linolenic acid; 
and 

- reductases involved in either the reduction of the 3-ketoacyl-CoA using NADPH to 
form 3-hydxoxyacyl-CoA; and/or in the reduction of trans-2,3-enoyl-CoA to saturated 
5 acyl-CoA. 

According to one specific, but non-limiting embodiment, the amino acid sequence 
corresponding to a condensation enzyme contains at least a histidine-rich motif 
(HA3HH), and preferably also contains an ELO-domain (as determined by SMART™ 
analysis (see http://smart.embl>heidelberg>de/) and/or preferably by PFAM protein 
1 0 search™ (see for example http://pfam.wustl.edu/). 

According to one specific, but non-limiting embodiment, the amino acid sequence 
corresponding to the reductase involved in the Second Step preferably contains an 
"adh_shorf" domain (as determmed by SMART™ analysis (see http://smart.embl- 
heidelberg.de/) and/or preferably by PFAM protein search™ (see for example 
15 http://pfam.wustl.edu/). 

According to one specific, but non-limiting embodiment, the amino acid sequence 
corresponding to the reductase involved in the Fourth Step preferably contains an 
''steriodjshort'' domain (as determined by SMART^^ analysis (see http://smart.embl- 
heidelberg.de/) and/or preferably by PFAM protein search™ (see for example 
20 http://pfam, wustl. edu/). 

According to a more specific but non-limiting embodiment, the amino acid 
involved in the elongation of fatty acids maybe chosen from the group consisting of: 
ELOl, EL02, EL03, Cig30, Sscl, Ssc2, LCE, ELOVLl, EL0VL2, EL0VL3, EL0VL4, 
EL0VL5 (HELOl), EL0VL6, KAR and TER, and natural or synthetic analogs thereof, 
25 as fiirfher defined below. 

In particular, the amino acid involved in the elongation of fatty acids may be 
chosen fi-om the group consisting of: Cig30, Sscl, Ssc2, LCE, ELOVLl, ELOVL2, 
EL0VL3, EL0VL4, EL0VL5 (HELOl), EL0VL6, KAR and TER, and natural or 
synthetic analogs thereof (as further defined below). 
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More in particular, the amino acid involved in the elongation of fatty acids may 
be chosen from the group consisting of: LCE, EL0VL6, KAR and TER, and natural or 
synthetic analogs Hiereof (as further defined below). 

Even more in particular, the amino acid involved in the elongation of fatty acids 
may be chosen firom the group consisting of: EL0VL6, KAR and TER, and natural or 
synthetic analogs thereof (as further defined below). 

Specifically, the amino acid sequence involved in the elongation of fatty acids 
maybechosenfromthegroupconsistingofSEQIDNOS:3, 5, 8, 11, 14, 17,20,23,26 
and 33, and natural or sjmthetic analogs thereof (as further defined below). 

More specifically, the amino acid sequoice involved in the elongation of fatty 
acids may be chosen firom the group consisting of: SEQ ID NO: 3, 23 and/or 26, and 
natural or synthetic analogs thereof (as further defined below). 

In further aspects, the invention also relates to the use of an amino acid sequence 
that is involved in the elongation of fatty acids (as defined above), and/or a host cell/host 
organism that contains and/or expresses such an amino acid sequence sequence, in one of 
the methods described above. For the purposes herein, the term "host celF also comprises 
cell fi:actions and/or preparations derived jfrom such cells, such as cytosolic and in 
particular microsomal preparations (as mentioned below). 

The invention also relates to compounds that can modulate (the biological activity 
of), and/or that can otherwise interact with, an amino acid that is involved in the 
elongation of fatty acids. The invention also relates to compositions that contain such 
compounds, and in particular to pharmaceutical compositions that contain such 
compounds. 

The invention further relates to the use of compounds that can modulate (the 
biological activity of), and/or that can otherwise interact with, an amino acid sequence 
that is involved in the elongation of fatty acids, in the preparation of pharmaceutical 
compositions, and in particular to the use of such compounds in the preparation of a 
pharmaceutical composition for the prevention and/or treatment of metabolic diseases. 

The invention also relates to compounds that can be used in the prevention and/or 
treatment of metabolic diseases (as further defined below), which compounds have or can 
be identified and/or developed using an amino acid sequence involved in the elongation 
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of fatty acids and/or a host cell or host organism that contains and/or expresses such an 
amino acid sequence. The invention also relates to contipositions that contain such 
compounds, and in particular to pharmaceutical compositions that contain said 
compounds, and to the use of such compounds in the preparation of a pharmaceutical 
5 composition, and in particular to the use of such compounds in the preparation of a 
pharmaceutical composition for the prevention or treatment of metabolic diseases. 

Unless explicitly specified herein, all terms used in the present description have 
their usual meaning in the art, for which particular reference is made to the definitions 
given in WO98/06737 and EP 1 085 089. 
10 • The amino acid sequences of SEQ ID NOS: 3, 5, 8, 1 1, 14, 1 7, 20, 23, 26 and 33 

were identified, and can be derived/isolated firom/using human cells; in the manner as 
fiirflier described in the prior art referred to above, or in any other suitable manner known 
per se. 

Generally, flie amino acid sequences of SEQ ID NOS: 3, 5, 8, 1 1, 14, 17, 20, 23, 

15 26 and 33 may be isolated fi-om the human cells and/or tissues using any technique(s) for 
protein isolation and/or purification known per se. Alternatively, the amino acid 
sequences of SEQ ID NOS: 3, 5, 8, 1 1, 14, 17, 20, 23, 26 and 33 may be obtained by 
suitable expression of a suitable nucleotide sequence - such as one of the nucleotide 
sequences of SEQ ID NOS: 1, 2, 4, 6, 7, 9, 10, 12, 13, 15, 16, 18, 19, 21, 22, 24 or 25, as 

20 applicable, or a suitable mutant thereof - in an appropriate host cell or host organism, as 
further described below. 

Also, it is expected that - based upon the disclosure herein - the skilled person will 
be able to identify, derive and/or isolate natural "analogs" (as mentioned above) of tiie 
amino acid sequences of SEQ ID NOS: 3, 5, 8, 1 1, 14, 17, 20, 23, 26 and 33. Such 

25 mutants could be derived from other human individuals and/or from (individuals of) other 
species. For example, such natural analogs may be obtained from yeast or from species of 
multicellular animals such as Drosophila melanogaster or Caenorhabditis elegans^ and in 
particular from other species of vertebrate animals, more in particular of mammals, such 
as rat, mouse, pig, sheep, rabbit, cow, horse or dog. 

30 Such natural analogs may again be obtained by isolating them from their natural 

source using any technique(s) for protein isolation and/or purification known per se, or 
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alternatively by suitable expression of a suitable nucleotide sequence - such as a natural 
"mutant" as described herein - in an appropriate host cell or host organism, as furtiier 
described below. 

It is also expected that - based upon the disclosure herein - the skilled person will 
be able to provide and/or derive synthetic "analogs" (as mentioned above) of one or 
more of the amino sequences of SEQ E) NOS: 3, 5, 8, 1 1, 14, 17, 20, 23, 26 and 33. 

Generally, such synthetic analogs may be obtained by suitable expression of a 
suitable nucleotide sequence used in the invention - such as a synthetic mutant as 
described above - in an appropriate host cell or host organism, as further described below. 

Preferably, any analogs as described herein will have one or more, and preferably 
all, of the structural characteristics/conserved features referred to above for Ihe sequences 
of SEQ ID NO: 3, 5, 8, 1 1, 14, 17, 20, 23, 26 and 33, such as - in the case of a 
condensation enzyme involved in the First Step above - the HXXHH motif and/or the 
ELO domain, and - in the case of a reductase involved in the Second Step above - an 
adh_short domain. 

It is also possible in the invention to use a part or fragment of one or more of the 
amino acid sequences of SEQ ID NOS 3, 5, 8, 1 1, 14, 17, 20, 23, 26 and 33, or a part or 
fragment of a (natural or synthetic) analog thereof This may for instance be a N- and/or 
C- truncated amino acid sequence. Also, two or more parts or fragments of one or more 
amino acid sequences involved in the elongation of fatty acids may be suitably combined 
to provide a (further) amino acid sequence for use in the mvention. 

Preferably, any such parts or fragments will be such that they comprise at least 
one continuous stretch of at least 5 amino acids, preferably at least 10 amino acids, more 
preferably at least 20 amino acids, even more preferably more than 30 amino acids, of 
one or more of the amino acid sequences of SEQ ID NO: 3, 5, 8, 1 1, 14, 17, 20, 23, 26 
and 33. 

In particular, any parts or fragments as described herein are such that they (at 
least) comprise the active/catalytic site of the corresponding anodno acid sequence 
involved in the elongation of fatty acids and/or a binding domain of the corresponding 
amino acid sequence involved in the elongation of fatty acids, such as - in the case of a 
condensation enzyme involved in the First Step above - the HXXHH motif and/or (part 
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of) tiie ELO domain, and - in the case of a reductase involved in the Second Step above - 
an adh_short domain. As will be clear to the skilled person, such parts or firagments may 
j5nd particular use in assay- and screening techniques (as generally described below) 
and/or (when said part or fragment is provided in crystalline form) in X-ray 
5 crystallography. 

Generally, such parts or fragments of the amino acid sequences involved in the 
elongation of fatty acids may be obtained by suitable expression of a suitable nucleotide 
■ sequence used in the invention - such as a suitable part or fragment as described herein 
for the nucleotide sequences used in the invention - in an appropriate host cell or host 

1 0 organism, as further described below. 

In addition and/or as an alternative to the methodology above, amino acid 
sequences used in the invention may also be provided by (chemically and/ or 
enzymatically) modifying the side chain(s) of one or more amino acid residues of an 
amino acid sequence of SEQ ID NO: 3, 5, 8, 11, 14, 17, 20, 23, 26 and 33 or a part, 

15 fragment, (natural and/or synthetic) mutant, variant, allele, analogs, orihologs thereof, for 
example by one or more of the side chain modifications as described in WO 01/02560 
and/or by incorporating (e.g. by insertion and/or substitution) one or more unnatural 
amino acid residues, again as described in WO 01/02560. 

Preferably, any analogs, parts and/or fragments as used herein will be such that 

20 . they have a degree of "sequence identity^', at the amino acid level, with one or more of 
the amino acid sequences of SEQ BDNOS 3, 5, 8, 11, 14, 17, 20, 23, 26 and 33 of at least 
50%, prefembly at least 60%, more preferably at least 70%, even more preferably at least 
80%, and in particular more than 90% and up to 95 % or more. 

For this purpose, the percentage of "sequence identity" between a given amino 

25 acid sequence and one of the amino acid sequences of SEQ ID NOS: 3, 5, 8, 1 1, 14, 17, 
20, 23, 26 and 33 may be calculated by dividing [the number of amino acid residues in 
the given amino acid sequence that are identical to the amino acid residue at the 
corresponding position in the amino acid sequence of the relevant SEQ ID NO] by [the 
total number of amino acid residues in the given amino acid sequence] and multiplying 

30 by [100%\, in which each deletion, insertion, substitution or addition of an amino acid 
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residue - compared to the sequence of the relevant SEQ ID NO - is considered as a 
difference at a single anmio acid (position). 

Alternatively, the degree of sequence identity may be calculated using a known 
computer algorithm, such as those mentioned above. 
5 Also, such sequence identity at the amino acid level may take into account so- 

called "conservative amino acid substitutions'', which are well known in the art, for 
example from GB-A-2 357 768, WO 98/49185, WO 00/46383 and WO 01/09300; and 
(preferred) types and/or combinations of such substitutions may be selected on the basis 
of the pertinent teachings from the references mentioned in WO 98/49185. 

10 Also, preferably, any analogs, parts and/or fragments as used herein will h_ave a 

biological activity that is essentially similar to the biological activity described above for 
the sequences of SEQ ID NOS: 3, 5, 8, 11, 14, 17, 20, 23, 26 and 33, i.e. to a degree of at 
least 10%, preferably at least 50 % more preferably at least 75%, and up to 90%, as 
measured by a suitable assay metihiod, for example those mentioned in the prior art cited 

15 herein for each of these sequences. 

Preferably, an amino acid sequence used in the invention will (also) have a length 
(expressed as total number of amino acid residues), which is at least 50%, preferably at 
least 60%, more preferably at least 70%, even more preferably at least 80%, and in 
particular more than 90% and up to 95 % or more of the length of one or more of the 

20 amino acid sequence of SEQ ID NOS: 3, 5, 8, 1 1, 14, 17, 20, 23, 26 and 33. 

It is also within the scope of the invention to use a fusion of an amino acid 
sequence involved in the elongation of fatty acids (as described above) with one or more 
further amino acid sequences, for example to provide a protein fusion. Generally, such 
fusions may be obtained by suitable expression of a suitable nucleotide sequence used in 

25 the invention - such as a suitable fusion of a nucleotide sequence used in the invention 
with one or more further coding sequences - in an appropriate host cell or host organism, 
as further described below. 

One particular embodiment, such fusions may comprise an amino acid sequence 
involved in flie elongation of fatty acids fused with a reporter protein such as GFP, 

30 luciferase or anotiier fluorescent protein moiety. As will be clear to the skilled person, 
such fusions may find particular use in expression analysis and similar methodologies. 
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In another embodiment, the fusion partner may be an amino acid sequence or 
residue that may be used in purification of the expressed amino acid sequence, for 
example using afiBnity techniques directed against said sequence or residue. Thereafter, 
said sequence or residue may be removed (e,g. by chemical or enzymatical cleavage) to 
provide the nucleotide sequence used in flie invention (for this purpose, the sequence or 
residue may optionally be linked to the amino acid sequence involved in the elongation of 
fatty acids via a cleavable linker sequence). Some preferred, but non-limiting examples of 
such residues are multiple histidine residues and glutatione residues, 

In one preferred, but non-limiting aspect, any such fusion will have a biological 
activity that is essentially similar to the biological activity described above for the 
sequences of SEQ ED NOS: 3, 5, 8, 1 1, 14, 17, 20, 23, 26 and 33, i.e. to a degree of at 
least 10%, preferably at least 50 % more preferably at least 75%, and up to 90%, as 
measured by a suitable assay method, for example those mentioned in the prior art cited 
herein for each of these sequences. 

As mentioned, the amino acid sequences involved in the elongation of fatty acids, 
or natural or synthetic analogs thereof, may be obtained by suitable expression of a 
nucleotide sequence encoding such an amino acid sequence in a suitable host organism, 
as fiirther described below. According to one embodiment, said nucleotide sequence may 
be in the form of a genetic construct, also as further described below. 

Some preferred, but non-limiting examples of nucleotide sequences that can be 
used to express the amino acid sequences involved in the elongation of fatty acids 
include, but are not limited to, the nucleotide sequences of SEQ ID NOS: 1, 2, 4, 6, 7, 9, 
10, 12, 13, 15, 16, 18, 19, 21, 22, 24 or 25, as applicable, or a suitable part, mutant, 
variant, allele, analog, orthologs, fusion or splice variant thereof (collectively referred to 
herein as ^'mutants^% depending on the desired amino acid sequence. 

The nucleotide sequences of SEQ ID NOS: 1, 2, 4, 6, 7, 9, 10, 12, 13, 15, 16, 18, 
19, 21, 22, 24 or 25 were identified, and can be derived/isolated from/using human cells 
and/or tissues; in any suitable maimer known per se, including but not limited to PGR 
starting from human genomic DNA or a library of human cDNA, using primers designed 
on the basis of the relevant sequence. 
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Also, it is expected that - based upon the disclosure herein - the skilled person will 
be able to identify, derive and/or isolate natural "mutants" (as mentioned above) of the 
above nucleotide sequences. For example, such mutants could be derived from oflier 
species (in which case these mutants will also be referred to herein as "ort/zo/ogj")- Some 
5 examples of species from which such orfhologs could be derived include, but are not 
limited to species of 

unicellular and/or micro-organisms such as bacteria, and yeast, 
invertebrate multicellular organisms as such as insects and nematodes (for example, 
agronomically harmftil insect or nematode species); 
10 - vertebrate multicellular organisms as such as fish, birds, reptiles, amphibians and 
mammals; 

Preferably, a natural ortholog is derived from a mammal such as a mouse, rat, 
rabbit or dog and the further species mentioned above.. 

Such natural mutants may be obtained in a manner essentially analogous to the 
1 5 methods described in the prior art referred to above, or alternatively by: 

construction of a DNA library from the species of interest in an appropriate 
expression vector system, followed by direct expression of the mutant sequence; 
construction of a DNA library from the species of interest in an appropriate 
expression vector system, followed by screening of said library with a probe used in 
20 the invention (as described below) and/or with a(nother) nucleotide sequence used in 

the invention; 

isolation of mRNA tiiat encodes the mutant sequence from the species of interest, 
followed by cDNA synthesis using reverse transcriptase; 
and/or by any other suitable method(s) or technique(s) known per se, for which reference 
25 is for instance made to the standard handbooks, such as Sambrook et al, "Molecular 
Cloning: A Laboratory Manual" ( 2nd.edO, Vols. 1-3, Cold Spring Harbor Laboratory 
Press (1989) and F. Ausubel et al, eds., "Current protocols in molecular biology", Green 
Publishing and Wiley Interscience, New York (1987). 

It is also expected that - based upon the disclosure herein - the skilled person -will 
30 be able to provide and/or derive synthetic mutants (as defined hereinabove) of the above 
nucleotide sequences. 
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Techniques for generating such synthetic sequences will be clear to the skilled 
person and may for instance include, but are not limited to, automated DNA synthesis; 
site-directed mutagenesis; combining two or more parts of one or more naturally 
occurring sequences, introduction of mutations that lead to the expression of a truncated 
5 expression product; introduction of one or more restriction sites (e.g. to create casettes 
and/or regions that may easily be digested and/or ligated using suitable restriction 
enzymes), and/or flie introduction of mutations by means of a PGR reaction using one or 
more "mismatched" primers, using for example a sequence of a naturally occurring 
GPCR as a template. These and other techniques will be clear to tihe skilled person, and 

10 reference is again made to the standard handbooks, such as Sambrook et al. and Ausubel 
et al., mentioned above. 

Preferably, any mutants used will encode amino acid sequences having one or 
more, and preferably all, of the structural characteristics/conserved features referred to 
for the elongation enzymes above (such as - in the case of a condensation enzyme 

1 5 involved in the First Step above - an HXXHH motif and/or an ELO domain in the case of 
an enzyme involved in the condensation reaction of the First Step above, and - in the case 
of a reductase involved in the Second Step above - an adh_short domain) and in 
particular.ttie active site(s) and/or catalytic domain(s). 

According to one embodiment, the "mutants" used are (also) such that they are 

20 capable of hybridizing with one or more of the nucleotide sequences of SEQ ID NO's 1 , 
2, 4, 6, 7, 9, 10, 12, 13, 15, 16, 18, 19, 21, 22, 24 or 25, i.e. under conditions of "moderate 
stringency, and preferably under conditions of 'Tiigh stringency". Such conditions will 
be clear to the skilled person, for example from the standard handbooks, such as 
Sambrook et al. and Ausubel et aL, mentioned above, as well as in EP 0 967 284, EP 1 

25 085 089 or WO 00/553 1 8; particular reference is made to the "stringent" hybridisation 
conditions described in WO 00/78972 and WO 98/49185, and/or the hybridization 
conditions described in GB 2 357 768-A. 

Also, any "mutants" used will preferably have a degree of "sequence identity", at 
the nucleotide level, with one or more of the nucleotide sequences of SEQ ID NOS: 1, 2, 

30 4, 6, 7, 9, 10, 12, 13, 15, 16, 18, 19, 21, 22, ,24 or 25 of at least 50%, preferably at least 
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60%, more preferably at least 70%, even more preferably at least 80%, and in particular 
more than 90%, and up to 95% or more. 

For this purpose, the percentage of "sequence identity" between a given 
nucleotide sequence and one of the nucleotide sequences used in the invention may be 
5 calculated by dividing [the number of nucleotides in the given nucleotide sequence that 
are identical to the nucleotide at the corresponding position in the nucleotide sequence of 
the relevant SEQ ID NO] by [the total number of nucleotides in the given nucleotide 
sequence] and multiplying by [100%\, in which each deletion, insertion, substitution or 
addition of a nucleotide - compared to the sequence of the relevant SEQ ID NO - is 
10 considered as a difference at a smgle nucleotide (position). 

Alternatively, the degree of sequence identity may be calculated using a known 
computer algorithm for sequence alignment such as NCBI Blast v2.0, using standard 
settings. 

Some other techniques, computer algorithms and settmgs for determining the 

15 degree of sequence identity are for example described in EP 0 967 284, EP 1 085 089, 
WO 00/55318, WO 00/78972, WO 98/49185 and GB 2 357 768-A. 

Generally, the nucleotide sequences used in the invention, when in the form of a 
nucleic acid, may be DNA or RNA, and may be single stranded or double stranded. For 
example, the nucleotide sequences used in the invention may be genomic DNA, cDNA or 

20 synthetic DNA (such as DNA with a codon usage that has been specifically adapted for 
expression in the intended host ceU or host organism). Thus, the nucleotide sequences 
used in the invention may contain intron sequences, and also generally comprises 
different splice variants. 

It is also within the scope used in the invention to use a fusion of a nucleotide 

25 sequence used in the invention (as described above) with one or more fiirther nucleotide 
sequence(s), including but not limited to one or more coding sequences, non-coding 
sequences and/or regulatory sequences. Preferably, in such fusions, the one or more 
further nucleotide sequences are operably connected (as described below) to the 
nucleotide sequence used in the invention (for example so that, when the further 

30 nucleotide sequence is a coding sequence, the nucleotide fusion encodes a protein fusion 
as described below). 
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Another embodiment used in the invention relates to a nucleic acid probe that is 
capable of hybridizing with a nucleotide sequence of SED ID NO: 1, 2, 4, 6, 7, 9, 10, 12, 
13, 15, 16, 18, 19,21,22,24 or 25 under conditions of moderate stringency, preferably 
under conditions of high stringency, and in particular imder stringent conditions (all as 
described above). Such nucleotide probes may for instance be used for detecting and/or 
isolating a(nother) nucleotide sequence used in the invention and/or as a primer for 
amplifying a nucleotide sequence used in the invention; all using techniques known per 
se, for which reference is again made to the general handbooks such as Sambrook et al. 
and Ausubel et al. mentioned above. 

Generally, such probes can be designed by the skilled person starting from a 
nucleotide sequence and/or amino acid sequence used in the invention - and in particular 
one or more of the sequences of SEQ ID NOS 1, 2, 4, 6, 7, 9, 10, 12, 13, 15, 16, 18, 19, 
21, 22, 24 or 25. 

The nucleic acids used in the invention may also be in the form of a genetic 
construct, which may for example comprise: 

a) the nucleotide sequence as described above; operably connected to: 

b) one or more regulatory elements, such as a promoter and optionally a suitable 
terminator; 

and optionally also: 

c) one or more further elements of genetic constructs known per se; 

in which the terms "regulatory elemenf\ "promoter"', "terminator", "further elements'" 
and '^operably connected!' have the meanings indicated hereinbelow. 

Such genetic constructs may be DNA or RNA, and are preferably double-stranded 
DNA, The constructs may also be in a form suitable for transformation of the intended 
host cell or host organism, in a form suitable for integration into the genomic DNA of the 
intended host cell or in a form suitable independent replication, maintenance and/or 
inheritance in the intended host organism. For instance, the genetic construct may be in 
the form of a vector, such as for example a plasmid, cosmid, YAC, a viral vector or 
transposon. In particular, the vector may be an expression vector, i.e. a vector that can 
provide for expression in vitro and/or in vivo (e.g. in a suitable host cell and/or host 
organism as described below). 
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As the one or more "further elements" referred to above, the genetic construct(s) 
used in the mvention may generally contain one or more suitable regulatory elements 
(such as a suitable promoter(s), enhancer(s), terminator(s), etc.), 3'- or 5'-UTR 
sequences, leader sequences, selection markers, expression markers/reporter genes, 
5 and/or elements that may facilitate or increase (the efficiency of) transformation or 

integration. These and other suitable elements for such genetic constmcts will be clear to 
the skilled person, and may for instance depend upon the type of constmct used, the 
intended host cell or host organism; the manner in which the nucleotide sequences used 
in the invention of interest are to be expressed (e.g. via constitutive, transient or inducible 
10 expression); and/or the transformation technique to be used. 

Preferably, in the genetic constructs used in the invention, the one or more further 
elements are "operabfy linkecT* to the nucleotide sequence(s) used in the invention and/or 
- to each other, by which is generally meant that they are in a fimctional relationship with 
each other. For instance, a promoter is considered ''operabty linkecf' to a coding sequence 
15 if said promoter is able to initiate or otherwise control/regulate the transcription and/or 
the expression of a coding sequence (in which said coding sequence should be 
understood as being ''under the control of said promoter) 

Generally, when two nucleotide sequences are operably linked, they will be in the 
same orientation and usually also in the same reading frame. They will usually also be 
20 essentially contiguous, although tiiis may also not be required. 

Preferably, the optional further elements of tiie genetic construct(s) used in the 
invention are such that they are capable of providing their intended biological function in 
the intended host cell or host organism. 

For instance, a promoter, enhancer or terminator should be ''operable'' in the 
25 intended host cell or host organism, by which is meant that (for example) said promoter 
should be capable of initiating or otherwise controlling/regulating the transcription and/or 
the expression of a nucleotide sequence - e.g. a coding sequence - to which it is operably 
linked (as defined above). 

Such a promoter may be a constitutive promoter or an inducible promoter, and 
30 may also be such that it (only) provides for expression in a specific stage of development 
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of the host cell or host organism, and/or such that it (only) provides for expression in a 
specific cell, tissue, organ or part of a multicellular host organism. 

Some particularly preferred promoters include, but are not limited to those present 
in the expression vectors referred to below. 
5 A selection marker should be such that it allows - i.e. under appropriate selection 

conditioiis - host cells and/or host organisms that have been (successfully) transformed 
with the nucleotide sequence used in the invention to be distinguished from host 
cells/organisms that have not been (successfully) transformed. Some preferred, but non- 
limiting examples of such markers are genes that provide resistance against antibiotics 

10 (such as kanamycine or ampicilline), genes that provide for temperature resistance, or 
genes that allow the host cell or host organism to be maintained in the absence of certain 
fectors, compoimds and/or (food) components in tiie medium that are essential for 
survival of the non-transformed cells or organisms. 

A leader sequence should be such that - in the intended host cell or host organism 

15 - it allows for the desired post-translational modifications and/or such that it directs the 
transcribed mRNA to a desired part or organelle of a cell. A leader sequence may also 
allow for secretion of the expression product from said cell. As such, the leader sequence 
may be any pro-, pre-, or prepro-sequence operable in the host cell or host organism. 
An expression marker or rq>orter gene should be such that - in the host cell or 

20 host organism - it allows for detection of the expression of (a gene or nucleotide sequence 
present on) the genetic construct An expression marker may optionally also allow for the 
localisation of the expressed product, e.g. in a specific part or organelle of a cell and/or in 
(a) specific cell(s), tissue(s), organ(s) or part(s) of a multicellular organism. Such reporter 
genes may also be expressed as a protein ftision with the amino acid sequence involved in 

25 the elongation of fatty acids. Some preferred, but non-limiting examples include 
fluorescent proteins such as GFP. 

For some (further) non-limiting examples of the promoters, selection markers, 
leader sequences, expression markers and further elements that may be present/used in 
tfie genetic constructs used in the invention - such as temiinators, transcriptional and/or 

30 translational enhancers and/or integration factors - reference is made to the general 

handbooks such as Sambrook et al. and Ausubel et al, mentioned above, to W.B. Wood et 
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al., ''Tlie nematode Caenorhabditis elegans'\ Cold Spring Harbor Laboratory Press 
(1988) and DX. Riddle et al., ELEGANSn'\ Cold Spring Harbor Laboratory Press 
(1997), as well as to the examples that are given in WO 95/07463, WO 96/23810, WO 
95/07463, WO 95/21 191, WO 97/1 1094, WO 97/42320, WO 98/06737, WO 98/21355, 
5 US-A-6,207,410, US-A- 5,693,492 and EP 1 085 089. Other examples will be clear to the 
skilled person. 

The genetic constructs used in the invention may generally be provided by 
suitably linking the nucleotide sequence(s) used in the invention to the one or more 
further elements described above, for example using the techniques described in the 
10 general handbooks such as Sambrook et al. and Ausubel et al., mentioned above. 

Often, the genetic constructs used in the invention will be obtained by inserting a 
nucleotide sequence used in the invention in a suitable (expression) vector known per se. 
Some preferred, but non-limiting examples of suitable expression vectors include: 
vectors for expression in mammalian cells: pMAMneo (Clontech), pcDNA3 
15 (Invitrogen), pMClneo (Stratagene), pSG5 (Stratagene), EBO-pSV2-neo (ATCC 

37593), pBPV-1 (8-2) (ATCC 371 10), pdBPV-MMTneo (342-12) (ATCC 37224), 
pRSVgpt (ATCC37199), pRSVneo (ATCC37198), pSV2-dhfr (ATCC 37146), 
pUCTag (ATCC 37460) and 1ZD35 (ATCC 37565); 
- vectors for expression in bacterials cells: pET vectors (Novagen) and pQE vectors 
20 (Qiagen); 

vectors for expression in yeast or other fungal cells: pYES2 (Invitrogen) and Pichia 
expression vectors (Invitrogen); 

vectors for expression in insect cells: pBlueBacII (Invitrogen). 

The nucleotide sequences and/or genetic constructs used in the invention may be 
25 used to transform a host cell or host organism. 

The host cell may be any suitable (fungal, prokaryotic or eukaryotic) cell or cell 
line, for example: 

a bacterial strain, including but not limited to strains of E.coliy Bacillus, 
Streptomyces and Pseudomonas; 
30 - a fungal ceU, including but not limited to cells from species of Aspergillus and 
Trichoderma; 
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a yeast cell, including but not limited to cells from species of Kluyveromyces or 
Saccharomyces; 

an amphibian cell or cell line, such as Xenopus oocytes. 

In one specific embodiment, which may particularly useful when the nucleotide 
sequences used in the invention are (to be) used in tiie discovery and development of 
insecticidal compounds, tiie host cell may be an insect-derived cell or cell line, such as: 

cells/cell lines derived from lepidoptera, including but not limited to Spodoptera SF9 

andSfZl cells, 

cells/cell lines derived from Drosophila, such as Schneider and Kc cells; and/or 
cells/cell liues derived from a pest species of interest (as mentioned below), such as 
from Heliothis virescens. 

In one preferred embodiment, the host cell is a mammalian cell or cell line, for 
example derived from flie mammals referred to above. 

In an even more preferred aspect, the host cell is a cell or cell line derived from a 
human, from the mammals including but not limited to CHO- and BHK-cells and human 
cells or cell lines such as HeLa and COS. 

In one specific, but non-limiting embodiment, the cell or cell line may be human 
cell or cell line which is related to metabolic processes or metabolic disease and/or used 
as a cellular model for metabolic disease, including but not limited to liver cells or cell 
lines, adipocytes or muscle cells or cell lines such as HBPG2 cells, 3T3L1 adipocytes, 
CTC12 cells and L6 myotubes. 

The host organism maybe any suitable multicellular (vertebrate or invertebrate) 
organism, including but not limited to; 

a nematode, including but not limited to nematodes from the genus Caenorhabditis^ 
such as C.eleganSy 

an insect, including but not limited to species of Drosophila and/or a specific pest 
species of interest (such £is those mentioned above); 
other well known model organisms, such as zebrafish; 
a mammal such as a rat or mouse; 

Other suitable host cells or host organisms will be clear to the skilled person, for 
example from the handbooks and patent applications mentioned above. 
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It should be noted that when a nucleotide sequence used in the invention is 
expressed in a multicellular organism, it may be expressed throughout the entire 
organism, or only in one or more specific cells, tissues, organs and/or parts thereof, for 
example by expression under the control of a promoter that is specific for said cell(s), 
tissue(s), organ(s) or part(s). 

The nucleotide sequence may also be expressed during only a specific stage of 
development or life cycle of the host cell or host organism, again for example by 
expression under the control of a promoter that is specific for said stage of development 
or life cycle. Also, as already mentioned above, said expression may be constitutive, 
transient and/or inducible. 

According to one specific embodiment, the expression of a nucleotide sequence 
used in the invention in a host cell or host organism may be party or totally reduced (i.e. 
knocked out), compared to the original (e.g. native) host cell or host organism. This may 
for instance be achieved in a transient manner using antisense and/or RNA-interference 
techniques well known in the art, or in a constitutive manner using random, site specific 
and/or chemical mutagenesis of the nucleotide sequence used in the invention, or any 
other suitable techniques for generating "kaock-down" or "knock-out" animals. 

Suitable transformation techniques will be clear to the skilled person and may 
depend on the intended host cell/host organism and the genetic construct to be used. 
Some preferred, but non-limiting examples of suitable techniques include ballistic 
transformation, (micro-)inj action, transfection (e.g. using suitable transposons), 
electroporation and lipofection. For these and other suitable techniques, reference is again 
made to the handbooks and patent applications mentioned above. 

After transformation, a step for detecting and selecting those host cells or host 
organisms that have been succesfiilly transformed with the nucleotide sequence/genetic 
construct used in the invention may be performed. This may for instance be a selection 
step based on a selectable marker present in the genetic construct used in the invention or 
a step involving the detection of the amino acid sequence involved in the elongation of 
fatty acids, e.g. using specific antibodies. 
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The transformed host cell (which may be in the form or a stable cell line) or host 
organisms (which may be in the form of a stable mutant line or strain) form further 
aspects of the present invention. 

Preferably, these host cells or host organisms are such that they express, or are (at 
least) capable of expressing (e.g. imd^ suitable conditions), an amino acid sequence 
involved in the elongation of fetty acids (and in case of a host organism: in at least one 
cell, part, tissue or organ thereof). The invention also includes further generations, 
progeny and/or offspring of the host cell or host organism used in the invention, fliat may 
for instance be obtained by cell division or by sexual or asexual reproduction. 

To produce/obtain expression of the amino acid sequences used in the invention, 
the transformed host cell or transformed host organism may generally be kept, 
maintained and/or cultured under conditions such that the (desired) amino acid sequence 
involved in the elongation of fatty acids is expressed/produced. Suitable conditions will 
be clear to the skilled person and will usually depend upon the host cell/host organism 
used, as well as on the regulatory elements that control the expression of the (relevant) 
nucleotide sequence used in the invraition. Again, referrace is made to the handbooks and 
patent applications mentioned above in the paragraphs on the genetic constructs used in 
the invention. 

Generally, suitable conditions may include the use of a suitable medium, the 
presence of a suitable source of food and/or suitable nutrients, the use of a suitable 
temperature, and optionally flie presence of a suitable inducing factor or compound (e.g. 
when the nucleotide sequences used in the invention are under the control of an inducible 
promoter); all of which maybe selected by the skilled person. Again, under such 
conditions, the amino acid sequences involved in the elongation of fatty acids may be 
expressed in a constitutive manner, in a transient manner, or only when suitably induced. 

It will also be clear to the skilled person that the amino acid sequence involved in 
the elongation of fatty acids may (first) be generated in an immature form (as mentioned 
above), which may then be subjected to post-translational modification, depending on the 
host cell/host organism used. Also, the andno acid sequence involved in the elongation of 
fatty acids may be glycosylated, again depending on the host cell/host organism used. 
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The amino acid sequence involved in the elongation of fatty acids may then be 
isolated from the host cell/host organism and/or from the medium in which said host cell 
or host organism was cultivated, using protein isolation and/or purification techniques 
known per se, such as (preparative) chromatography and/or electrophoresis techniques, 
5 differential precipitation techniques, afSBbaity techniques (e.g. using a specific, cleavable 
amino acid sequence fused with the amino acid sequence involved in the elongation of 
fatty acids) and/or preparative immunological techniques (i.e. using antibodies against the 
amino acid sequence to be isolated). 

In one embodiment, the amino acid sequence thus obtained may also be used to 
10 generate antibodies specifically against said sequence or an antigenic part or epitope 
thereoif. 

Such antibodies, which form a fiuther aspect of the invention, may be generated 
in a manner known per se, for example as described in GB-A-2 357 768, US-A- 
5,693,492, WO 95/32734, WO 96/23882, WO 98/02456, WO 98/41633 and/or WO 

15 98/49306, and/or as described in the prior art referred to above. Often, but not exclusively, 
such methods will involve inraiunizing a immunocompetent host with the pertinent amino 
acid sequence involved in the elongation of fatty acids or an immungenic part thereof 
(such as a specific epitope), in amount(s) and according to a regimen such that antibodies 
against said amino acid sequence are raised, and than harvesting the antibodies thus 

20 generated, e.g. firom blood or serum derived &om said host. 

For instance, polyclonal antibodies can be obtained by immunizing a suitable host 
such as a goat, rabbit, sheep, rat, pig or mouse with (an epitope of) an amino acid sequence 
involved in the elongation of fatty acids, optionally with the use of an immunogenic carrier 
(such as bovine semm albumin or keyhole limpet hemocyanin) and/or an adjuvant such as 

25 Freund's, saponin, ISCOM's, aluminium hydroxide or a similar mineral gel, or keyhole 
limpet hemocyanin or a similar surface active substance. After a suitable inamune response 
has been raised (usually within 1-7 days), the antibodies can be isolated firom blood or serum 
taken from the immimized animal in a manner known per se, which optionally may involve 
a step of screening for an antibody with desired properties (i.e. specificity) using known 

30 immunoassay techniques, for which reference is again made to for instance WO 96/23882. 
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Monoclonal antibodies may for example be produced using continuous cell lines in 
culture, including hybridoma-based and similar techniques, again essentially as described in 
the above cited references. Accordingly, cells and cell lines that produce monoclonal 
antibodies agmnst an amino acid sequence involved in the elongation of fatty acids form a 
5 furttier aspect of the invention, as do methods for producing antibodies against amino acid 
sequences involved in the elongation of fetty acids, which methods may generally involve 
cultivating such a cell and isolating the antibodies from the culture (medium), again using 
techniques known per se. 

Also, Fab-fragments against the amino acid sequences involved in the elongation 

10 of fatty acids (such as F(ab)2, Fab' and Fab fragments) may be obtained by digestion of an 
antibody with pepsin or another protease, reducmg disulfide-linkages and treatment widi 
papain and a reducing agrat, respectively. Fab-expression libraries may for instance be 
obtained by the method of HuseetaL, 1989, Science 245:1275-1281. 

fri another embodiment, the nucleotide sequences used in the invention, the amino 

1 5 acid sequences used in the invention, and/or a host cell or host organism that ejqsresses 
such an amino acid sequence, may also be used in an assay or assay method generally 
(including but not limited to diagnostic assays and/or assays to determing the presense 
and/or absence of specific mutations and/or genetic markers, for example to determine 
susceptibility for a condition or disease associated with such a mutation or marker), and 

20 in particular in an assay to identify and/or (further) develop compounds and/or other 
factors that can modulate the (biological) activity of, and/or ttiat can otherwise interact 
with, the amino acid sequences involved in the elongation of fatty acids, and such uses 
form further aspects of the invention. As will be clear to the skilled person, in this 
context, the amino acid sequence involved in the elongation of fatty acids will serve as a 

25 target for interaction with such a compound or factor 

In this context, the terms "moiw/ate", ''modulation, ''modulator'' and ''target" will 
have their usual meaning in the art, for which reference is inter alia made to the 
definitions given in WO 98/06737. Generally, a modulator is a compound or factor that 
can enhance, inhibit/reduce or otherwise alter, influence or affect (collectively referred to 

30 as ''modulation'') a functional property of a biological activity or process (for example. 
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the biological activity of an amino acid sequence involved in the elongation of fatty 
acids). 

In this context, tiie amino acid sequence involved in the elongation of fatty acids 
may serve as a target for modulation in vitro (e.g. as part of an assay or screen) and/or for 
5 modulation in vivo (e.g. for modulation by a compound or factor that is known to 
modulate the target, which compound or fector may for example be used as an active 
compound for agrochemical, veterinary and/or pharmaceutical use). 

For example, the amino acid sequences, host cells and/or host organisms used in 
the invention may be used as part of an assay or screen that may be used to identify 

10 and/or develop, modulators of the amino acid sequence involved in the elongation of fatty 
acids, such as a primary screen (e.g. a screen used to identify modulators of the target 
from a set or library of test chemicals with unknown activity with respect to the target) 
and/or a secondary assay (e.g. an assay used for validating hits from a primary screen 
and/or used in optimizing hit molecules, e.g. as part of hits-to-leads chemistry), 

1 5 For instance, such an assay or screen may be configured as an in vitro assay or 

screen, which will generally involve binding of the compoimd or factor to be tested as a 
potential modulator for the target (hereinbelow also referred to as "test chemical") to the 
target, upon which a signal generated by said binding is measured. Suitable techniques 
for such in vitro screening will be clear to the skilled person, and are for example 

20 described in Eldefrawi et al, (1987). FASEB J., Vol.1, pages 262-271 and Rauh et al, 
(1990), Trends in Pharmacol. Sci., vol.1 1, pages 325-329. For example, such an assay or 
screen maybe configured as a binding assay or screen, in which the test chemical is used 
to displace a detectable ligand from fee target (e.g. a radioactive or fluorescent ligand), 
upon which the amount of ligand displaced from the target by the modulator is 

25 determined. Other suitable assays for the amino acid sequences involved in the 

elongation of fatty acids will be clear to the skilled person, and may for example be found 
in the prior art cited herein; such assays may optionally also be adapted to and/or 
configured for screening in an automated, medium-to-high throughput fashion. 

For example, reductases involved the elongation of fatty acids such as KAR and 

30 TER may be screened for modulators (such as inhibitors) using one of the following 
techniques: 
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1) Radioactive detection method based on the use of a radioactive labeled substrate, for 
example ^"^C labeled Malonyl-CoA (see also Moon and Horton, supra). To achieve 
the separation between the reactants and products that facilitates quantification (in 
particular when the assay is performed in a high throughput mode, compared to the 
assay of Moon and Horton), the fetty acid chain (e.g. palmitoyl-CoA,) could be bound 
to streptavidin (directly or via biotin) instead of BSA (as used by Moon and Horton), 
which in turn would be coated on the bottom of the microtitre well plate. Upon the 
completion and stopping of the enzymatic reaction, the plates would be subjected to a 
washing protocol prior to the scintillation measurement. The quantity if the 
incorporated ^^C would be proportional to the quantity of the ^^C malonyl-CoA 
condensed to the palmytoyl-CoA. 

Alternatively, an SPA™ assay can be envisaged in which the palmitoyl-CoA is 
attached to an SPA bead. Upon the reaction wifli ^"^C-labeled malonyl-CoA, the ^"^C 
would be incorporated to the fatty acid chain fixed to the SPA bead, generating 
scintillation fi-om the bead due to the proximity of the radioactive label. (Amersham 
patented technology). The advantage of this assay configuration would be that no 
washing is required. 

2) Luminescent assay based on the detection of FMN cofactor (see for example Wame, 
et al, 2002, Poster #2448, 8th Annual Conference and Exhibition of the Society for 
Biomolecular Screening, September 22-26th 2002, The Hague, the Netherlands), 
which is suitable for the detection of the activity of all the three enzymes involved in 
the fatty acid elongation. This may be performed ei&er by using the isolated enzyme, 
or a suitable microsomal preparation containing tow or more, or essentially all, 
enzymes of the elongation pathway. 

3) Colorimetric detection of NADH consumption. In the assay described by Moon and 
Horton, supra, NADH is used as a source of hydrogen in the reduction of the 3- 
ketoacyl-CoA. A colorimetric method has been described recently to detect directly 
the consumption of NADH and formation of NAD (See Mayer and Arnold, 2002, J. 
Biomol. Screening, Vol.7 (2), ppI35). hi this way, the reduction reaction catalyzed by 
KAR can be monitored by measuring flie consumption of NADH. 
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4) Fluorescence Resonance Electron Transfer (FRET) can be detected when two suitable 
fluorescent molecules are brought in proximity of each other. As the enzymatic 
reaction perfonned by KAR is a reduction, it should be in principle possible to detect 
product formation by FRET between the individually labeled reactants, provided the 
5 the fluorophore are chosen such (especially respective to the size of malonyl-CoA) 
that steric hindrances can avoided; 
and/or by one of the assays described in the prior art for KAR and TER, such as the prior 
art mentioned herein, which assay may optionally be adapted and/or configured for 
automation and/or medium to high through-put 
10 The condensation enzymes involved in the First Step above could also be 

screened using a luminescent assay based on the detection of FMN cofactor, as described 
under 2) above; and/or by one of the assays described in the prior art for said 
condensation enzymes, such as the prior art mentioned herein, which assay may 
optionally be adapted and/or configured for automation and/or medium to high through- 
15 put 

Assays or screens for identifying compounds that can interact with the amino acid 
sequences involved in the elongation of fatty acids may also be configured as a cell-based 
assay or screen, in which a host cell used in the invention is contacted with/exposed to a 
test chemical, upon which at least one biological response by the host cell is measured. 

20 For example, such cell-based assays may be performed by (over)expression of cDNA 
encoding the elongase(s) to be screened in S. cerevisiae (as perfonned for the human 
elongases HELOl (= EL0VL5) by Leonard et al., supra) or in a mutant of 5. cerevisiae 
that lacks one or more of the native elongations enzymes ELOl , EL02 and EL03 (see 
Toke et al. and Oh et al., both supra). The influence of the compound on the elongase(s) 

25 may then be determined by exposing the cells to a suitable substrate and the compound(s) 
to be tested, analogous to the methods described in the references just cited. 

Such cell-based methods may also allow, with advantage, to screen the influence 
of the compound(s) to be tested not only on the elongase(s) to be screened, but also on 
the whole elongation pathway or cell, and may as such form a suitable secondary assay 

30 for confinnation of hits generated using an in vitro HTS screen as described above. 
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Alternatively, for identifying compounds that influence the entire elongation 
pathway (i.e. two or more enzymes therein), the luminescent assay based on the detection 
of FMN cofactor described under 2) above could be used, using a suitable microsomal 
preparation containing all enzymes from the elongation pathway. 
5 Suitable cells or cell lines for such cell based assays include those mentioned 

above. In one preferred, but non-limitig embodiment, the cell or cell line may be a 
mammalian, and in particular human, cell or cell line which is related to metabolic 
processes or metabolic disease and/or used as a cellular model for metabolic disease, 
including but not limited to liver cells or cell lines, adipocytes or muscle cells or cell lines 
10 such as HEPG2 cells, 3T3L1 adipocytes, CTC12 cells and L6 myotubes. 

Also, such an assay or screen may also be configured as an whole animal screen, 
in which a host organism used in the invention is contacted with/exposed to a test 
chemical, upon which at least one biological response (such as a phenotypical, 
behavioural and/or fysiological change, including but not limited to paralysis or death) by 
15 the host organism is measured. Such screens may be carried out in any model organism 
known per se, including but not limited to yeast, Drosophila, zebrafish or C elegans. 

Thus, generally, the assays and screens described above will comprise at least one 
step in which the test chemical is contacted with the target (and/or with a host cell or host 
organism that expresses the target), and in particular in such a way that a signal is 
20 generated that is representative for the modulation of the target by the test chemical. In a 
further step, said signal may then be detected. 

The compounds that may be tested using the methods of the invention are 
generally described below. 

The assays and screens used in the invention may be carried out at medium 
25 throughput to high throughput, for example in an automated fashion using suitable 
robotics. In particular, in this embodiment, the method used in the invention may be 
carried out by contacting the target with the test compound in a well of a multi-well plate, 
such as a standard 24, 96, 384, 1536 or 3456 well plate. 

Usually, in a screen or assay used in the invention, for each measurement, the 
30 target or host cell/host organism will be contacted with only a single test compound. 
However, it is also within the scope of the invention to contact the target with two or 
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more test compounds - either simultaneously or sequentially - for example to determine 
whether said combination provides a synergistic effect. 

According to one specific, but non-limiting embodiment of the invention, when 
the compound is intended to interact with a condensation enzyme involved in the First 
5 Step above, the compound used is also preferably such that it is selective for one 
condensation enzyme (i.e. with a specific substrate specificity, as described above) 
compared to another condensation enzyme (i.e. with a different substrate specificity). 

In particular, the compound used is preferably such that it is selective for the 
condensation enzymes which are indicated hereinabove as preferred, compared to 

iO condensation enzymes that indicated hereinabove as being less preferred. For example, 
the compound used is preferably selective for a condensation enzyme that has specificity 
(as defined above) for fatty acids with a length of the carbon chain of 20 carbon atoms or 
less, compared to a condensation enzyme that has specificity for fatty acids with a lengfli 
of the carbon chain of more than 20 carbon atoms. 

15 In particular, the compound used is preferably such that is has specificity for LCE 

and/or EL0VL6, compared to other condensation enzymes. 

The selectivity of a compound for one enzyme/isofonn compared to another may 
be determined in a manner known per se, which generally involves comparing the 
influence the compound has on the activity of one enzyme/isoform compared to another, 

20 using a suitable assay (usually at the same concentration(s) of compound, enzyme and 
substrate). For example, when the compound is an inhibitor, the selectivity may be 
established by comparing the IC50 or ED50 values of the compound with respect of each 
enzyme, as is well known in the art, with a lower ED50 or IC50 value with respect to a 
first enzyme compared to a second enzyme being an indication that the compound is 

25 more selective for the first enzyme compared to the second. 

In particular, in the invention, a compound is considered selective for a first 
enzyme compared to a second when the ED50 or IC50 value of the compound witti 
respect to the first razyme is less than 90%, preferably less than 50%, more preferably 
less than 25%, even more preferably less than 10%, an in particular less than 5%, such as 

30 less than 1% or even less than 0.1%, of the ED50 or IC50 value for the second enzyme. 
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Once a test chemical has been identified as a modulator for an amino acid 
sequence involved in the elongation of fatty acids (e.g. by means of a screen or assay as 
described hereinabove), it may be used per se as a modulator of the amino relevant amino 
acid sequence involved in the elongation of fatty acids (e.g. as an active substance for 
5 pharmaceutical use), or it may optionally be farther optimized for final use, e.g. to 

improve properties such as solubility, ADME-TOX and other desired properties. It will 
be clear to the skilled person that the nucleotide sequeaaces, amino acid sequences, host 
cells/host or^nisms and/or methods used in the invention may find further use in such 
optimization methodology, for example as (part of) secondary assays. 

10 The invention is not particularly limited to any specific manner or mechanism 

in/via which the modulator (e.g. the test chemical, compound and/or factor) modulates, or 
interacts with, the target (in vivo and/or in vitro). For example, the modulator may an 
agonist, an antagonist, an inverse agonist, a partial agonist, a competitive inhibitor, a non- 
competitive inhibitor, a cofactor, an allosteric inhibitor or other allosteric factor for the 

15 target, and/or may be a compound or factor that enhances or reduces binding of target to 
another biological component associated with its (biological) activity, such as another 
protein or polypeptide, a receptor, or a part of organelle of a cell. As such, the modulator 
may bind with the target (at the active site, at an allosteric site, at a binding domaia 
and/or at another site on the target, e.g. covalently or via hydrogen bonding), block the 

20 active site of the target (in a reversible, irreversible or competitive manner), block a 

binding domain of the target (in a reversible, irreversible or competitive mamier), and/or 
influence or change the conformation of the target. 

As such, flie test chemical/modulator may for instance be: 
an analog of a known substrate of the target; 

25 - an oligopeptide, e.g. comprising between 2 and 20, preferably between 3 and 1 5 
amino acid residues; 

an antisense or double stranded RNA molecule; 
a protein, polyp eptide; 
a cofactor or an analog of a cofactor. 
30 Preferably, the compound is an inhibitor of the target, although the invention in its 

broadest sense is not limited thereto. 
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The test chemical/modulator may also be a reference compound or factor, which 
may be a compound that is known to modulate or otherwise interact with the target (e.g. a 
known substrate or inhibitor for the target) or a compound or factor that is generally 
known compound that is known to modulate or otherwise interact with other members 
5 from the general class to which the target belong (e.g. a known substrate or inhibitor of 
said class). 

Preferably, however, the compound(s) will be "small molecules", by which is 
generally meant herein a molecular entity with a molecular weight of less than 1500, 
preferably less than 1000. This may for example be an organic, inorganic or 

10 organometallic molecule, which may also be in the form or a suitable salt, such as a 

water-soluble salt; and may also be a complex, chelate and/or a similar molecular entities, 
as long as its (overall) molecular weight is withia the range indicated above. 

In a preferred embodiment, such a "small molecule" has been designed according, 
and/or meets the criteria of, at least one, preferably at least any two, more preferably at 

15 least any three, and up to all of the so-called Lipinski rules for drug likeness prediction 
(vide Lipinksi et aL, Advanced Drug Delivery Reviews 23 (1997), pages 3-25). As is 
known in the art, small molecules which meet these criteria are particularly suited (as 
starting points) for the design and/or development of pharmaceuticals for human use, and 
may for instance be used as starting points for hits-to-leads chemistry, and/or as starting 

20 points for lead development (in which the methods used in the invention may also be 
applied). 

Also, for these purposes, the design of such small molecules (as well as the design 
of libraries consisting of such small molecules) will preferably also take into account the 
presence of pharmacophore points, for example according to the methods described by I. 
25 Muegge et al., J. Med. Chem. 44, 12 (2001), pages 1-6 and the documents cited herein. 

The term "small peptide " generally covers (oligo)peptides that contain a total of 
between 2 and 35, such as for example between 3 and 25, amino acids (e.g. in one or 
more connected chains, and preferably a single chain). It will be clear that some of these 
small peptides will also be included in the tain small molecule as used herein, depending 
30 on their molecular weight. 
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In one preferred, but non-limiting embodiment, the invention is used to screen a 
set or library of (related or otherwise unrelated) small molecules, for example a standard 
"robustness sef , a primary screening library (e.g. of otherwise unrelated compounds), a 
combinatorial library, a series of closely related chemical analogos. Such sets or libraries 
will be clear to the skilled person, and may for instance include, but are not limited to, 
such commercially available chemical libraries such as the various libraries available 
jBrom Tocris Cookson, Bristol, UK* 

The modulators thus identified by the methods used in the invention can be used 
in (the preparation of a pharmaceutical composition for) the prevention and/or treatment 
of metabolic diseases, and/or can be used to develop other compounds that can be used in 
(the preparation of a pharmaceutical composition for) the prevention and/or treatment of 
metabolic diseases, i.e. as aheady outlined above. 

Also, as already mentioned above, the use of the human amino acid sequences of 
SEQ ID NOS: 3, 5, 8, 1 1, 14, 17, 20, 23, 26 and 33, and/or of sequences derived 
therefirom (such as analogs, parts, fragments, and/or fusions thereof as described 
hereinabove), and of host cells/host organisms containing/'expressing these, are usually 
preferred, in particular when the invention is used to develop compoimds for 
pharmaceutical use. 

As already mentioned above, the compounds and/or factors that have been 
identified and/or developed as modulators of the amino acid sequences involved in the 
elongation of fatty acids (and/or precursors for such compoimds) may be usefiil as active 
substances in the pharmaceutical field, for example in the preparation of pharmaceutical 
compositions, and both such modulators as well as (pharmaceutical) compositions 
containing them fiirther aspects of the invention. 

In particular, the compounds and composition of the invention may be used in 
(the preparation of pharmaceutical compositions for) the prevention (e.g. prophylaxis) 
and/or treatment of metabolic diseases (which for the purposes herein in its broadest 
sense also includes preventing, treating and/or alleviating the symptoms and/or 
complications of such metaboUc diseases). 

In particular, such compoimds and composition may be used in (the preparation of 
pharmaceutical compositions for) the prevention (e.g. prophylaxis) and/or treatment of 
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metabolic diseases (which for the purposes herein in its broadest sense also includes 
preventing, treating and/or alleviatmg the symptoms and/or complications of such 
metabolic diseases). 

In particular, the compounds and compositions of the invention may be used for 
5 preventing and/or treating: 

hyperglycemic conditions and/or oHi&r conditions and/or diseases that are (primarily) 
associated with (the response or sensitivity to) insulin, including but not limited to all 
forms of diabetes and disorders resulting from insulin resistance, such as Type I and 
Type n diabetes, as well as severe insulin resistance, hyperinsulinemia, and 
10 hyperUpidemia, e.g., obese subjects, and insulin-resistant diabetes, such as 

Mendenhairs Syndrome, Wraaer Syndrome, leprechaunism, lipoatrophic diabetes, 
and oflier lipoatrophies; 

conditions caused or usually associated with hyperglycemic conditions and/or 
obesity, such as hypertension, osteoporosis and/or lipodystrophy. 
15 - so-called "metabolic syndrome" (also known as "Syndrome X") which is a condition 
where several of the following conditions coexist: hypertension; insulin resistance; 
diabetes; dyslipidemia; and/or obesity. 

In particular, the compounds and compositions of the invention may be used for 
preventing and/or treating diabetes, especially Type l and Type TL diabetes. "Diabetes" 
20 itself refers to a progressive disease of carbohydrate metabolism involving inadequate 
production or utilization of insulin and is characterized by hyperglycemia and glycosuria. 

Also, as mentioned above, the amino acid sequences involved in the elongation of 
fatty acids and in particular the nucleotide sequences used in the invention, and more in 
particular the human amino acid sequences and nucleotide sequences used in the 
25 invention may be used for diagnostic purposes, for example as part of diagnostic assays 
and/or as part of kits for performing such assays (in which such a kit will comprise at 
least a nucleotide sequence used in the invention, may be suitably packaged (e.g. in a 
suitbale container) and may optionally further comprise one or more elements for such 
kits known per se, such as suitable reagents, buffers or other solvents, and instructions for* 
30 use). 
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In particular, the amino acid sequences and nucleotide sequences used in llie 
invention, as well as assays and kits using such sequences, may be used for diagnostic 
purposes relating to one or more of the metabolic diseases indicated above, for example 
as assays to determine Ihe presense and/or absence in an individual of specific mutations 
and/or genetic markers that relate to one or more of the metabolic diseases referred to 
above, to determine the susceptibility and/or any predisposition for any of tiie metabolic 
diseases referred to above in an individual, to deteraiine if any genetically determined 
factors contribute or even cause (in foil or in part) a metabolic disease in an individual, 
determine and/or to confirm the kind of metabolic disease from which an individual 
suffers, and/or to predict the fiuther progress of a metabolic disease in an individual. It 
will also be clear that any results obtained using such a diagnostic method or assay may 
also provide guidance to the clinician as to how a metaboUc diseaase should be teeated in 
an individual, e.g. which diet should be followed and/or which medication should be 
prescribed and/or the dosis regimen to be used. 

It should also be noted that, for the treatment of the metabolic disease in humans, 
the compound used will usually and preferably be an inhibitor of an amino acid sequence 
involved in Ihe elongation of fatty acids, although tiie invention is its broadest sense is 
not limited thrareto. 

In one specific, but non-limiting, embodiment of the invention, a compound is 
considered an inhibitor of one of the amino acid sequences involved in the elongation of 
fatty acids if, in a relevant assay such as the kinase activity assays referred to above (or a 
suitable modification thereof, for example using partially or fidly purified protem), said 
compound reduces the activity of said amino acid sequence, i.e. by at least 1%, preferably 
at least 10%, such as by 20% or more, compared to the activity without the presence of 
said compound. 

In an even more specific, but non-limiting, embodiment of tiie invention, a 
compound is considered an inhibitor of one of tiie amino acid sequences involved in tiie 
elongation of fetty acids if, in a relevant assay, such as a binding assay, said compound 
has an IC50 value of less tiian 1000 urn, preferably at tiian 500 ^tm, more preferably less 
than 250/iM, even more preferably less than 100 fm, for example 50 ixm or less, such as 
about 10 jtim or less. 
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Again, preferably, in the invention compounds are used that are modulators, and 
in particular inhibitors, of the human ammo acid sequences of SEQ ID NO: 3, 5, 8, 11, 
14, 17, 20, 23, 26 and 33, and/or of amino acid sequences derived therejfrom, such as 
analogs, mutants, parts, fragmmts and/or fusions as described above. 
5 For pharmaceutical use, the compounds of the invention may be used as a free 

acid or base, and/or in the form of a pharmaceutically acceptable acid-addition and/or 
base-addition salt (e.g. obtained with non-toxic organic or inorganic acid or base), in the 
form of a hydrate, solvate and/or complex, and/or in the form or a pre-drug, such as an 
ester. Such salts, hydrates, solvates, etc. and the preparation thereof will be clear to the 

10 skilled person; reference is for instance made to the salts, hydrates, solvates, etc. 
described in US-A-6,372,778, US-A-6,369,086 and US-6,369,067 

Generally, for pharmaceutical use, the compounds of the inventions may be 
formulated as a pharmaceutical preparation comprising at least one compound of the 
invention and at least one pharmaceutically acceptable carrier, diluent or excipient and/or 

15 adjuvant, and optionally one or more further pharmaceutically active compounds. By 
means of non-limiting examples, such a foraiulation maybe in a form suitable for oral 
administration, for parenteral administration (such as by intravenous, intramuscular or 
subcutaneous injection or intravenous infusion), for topical administration, for 
administration by inhalation, by a skin patch, by an implant, by a suppository, etc.. Such 

20 suitable administration forms - which may be solid, semi-solid or liquid, depending on 
the manner of administration - as well as methods and carriers for use in the preparation 
thereof, will be clear to the skilled person; reference is again made to for instance US-A- 
6,372,778, US-A-3,696, 086 and US-6,369,067. 

The pharmaceutical preparations of the invention are preferably in a unit dosage 

25 form, and may be suitably packaged, for example in a box, blister, vial, bottle, sachet, 
ampoule or in any other suitable holder or container (which may be properly labeled); 
optionally with one or more leaflets containing product information and/or instmctions 
for use. Generally, such unit dosages will contain between 1 and 500 mg of the at least 
one compound of the invention, e.g. about 10, 25, 50, 100, 200, 500 or 1000 mg per unit 

30 dosage. 
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For phannaceutical use, at least one compound of the invention will generally be 
administered in an amount of between 0.01 to 150 mg/kg body weight per day of the 
patient, divided over one or more daily doses. The amount(s) to be administered and the 
further treatment regimen may be determined by the treating clinician, depending on 

5 factors such as the age, gender and general condition of the patiait and the nature and 
severity of the disease/symptoms to be treated. 

Thus, in a fiarther aspect, the invention relates to a composition, and in particular a 
composition for pharmaceutical use, that contains at least one compound of the invention 
(i.e. a compound tiiat has been identified, discovered and/or developed using a nematode 

10 or method as described herein) and at least one suitable carrier (i.e. a carrier suitable for 
pharmaceutical use). The invention also relates to the use of a compound of the invention 
in the preparation of such a composition. 

Preferably, tiie compounds and compositions of the invention are administered 
orally and/or in a form intended and/or suitable for oral administration. 

15 It is also aivisaged that Ae above compounds and conq>ositions may be of value 

in the veterinary field, which for the purposes herein not only includes the prevention 
and/or treatment of diseases in animals, but also - for economicaUy important animals 
such as cattle, pigs, sheep, chicken, fish, etc. - enhancing the growth and/or weight of the 
animal and/or the amount and/or the quality of the meat or other products obtained fixim 

20 the animal. Thus, in a further aspect, the invention relates to a composition for veterinary 
use that contaios at least one compound of the invention (i.e. a compound that has been 
identified, discovered and/or developed using a nematode or method as described herein) 
and at least one suitable carrier (i.e. a carrier suitable for veterinary use). The invention 
also relates to the use of a compound of the invention in the preparation of such a 

25 composition. 

In the agrochemical field, the invention may be used to identify compounds 
suitable for use in pesticides, insecticides, nematicides and/or other biocides or plant 
protection agents. For example, the compounds invention may be used to control the 
species listed in US-A-6,3 72,774. For this purpose, the compounds of the invention (or a 

30 suitable salt, hydrate or ester thereof) may be suitably fonnulated with one or more 
agrochemically acceptable carries, to provide a formulation suitable for agrochemical 
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use, as will be clear to the skilled person (reference is for example made to the 
formulations and uses described in US-A-6,372,774). 

Thus, in a farther aspect, the invention relates to a composition for agrochemical 
use that contains at least one compound of the invention (i.e. a compound that has been 
5 identified, discovered and/or developed using a nematode or method as described herein) 
and at least one suitable carrier (i.e. a carrier suitable for agrochemical use). The 
invention also relates to the use of a compound of flie invention in tiie preparation of such 
a composition. 

The mvention will now be further illustrated by means of the following non- 
10 limiting Experimental Part. 
La the Figures: 

- Figure 1 schematically shows vector pGN49A (see also WO 00/0 1 846 and UK 
patent application no. GB 0012233, both by Applicant); 

- Figures 2A and 2B are photographs (enhanced using the Scion Image (Scion Coip) 
15 software package) showing reduced fat-absorption phenotype in C. elegans upon 

Nile Red Staining: Figure 2A = reduced fat storage (invention); Figure 2B = 
reference (reference vector gGN29 without RNAi insert). 



Experimental part: 

20 hi tiie Experimental Part below, unless indicated otherwise, all steps for handling 

and cultivating C. elegans were performed using standard techniques and procedures, for 
which reference is made to the standard C. elegans handbooks, such as W.B. Wood et al., 
"The nematode Caenorhabditis elegans". Cold Spring Harbor Laboratory Press (1988); 
D.L. Riddle et al., "C. ELEGANS IF, Cold Spring Harbor Laboratory Press (1997); 

25 ''Caenorhabditis elegans. Modem Biological analysis of an organism": ed. by H. Epstein 
and D. Shakes, Metiiods in Cell Biology, Vol 48, 1995; and "C. elegans. a practical 
approach", ed. by LA. Hope, Oxford University Press Inc. New York, USA, 1999. 

Downregulation of the gene(s) of interest in C. elegans was achieved by RNAi 
feeding techniques using an E.coli strain capable of expressing a dsRNA corresponding 

30 to the gene(s) of interest, as generally described in - inter alia - tiie hitemational 
application WO 00/01846 by appUcant and tiie handbooks referred to above. 
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Also, unless indicated otherwise, all cloning and other molecular biology steps 
were performed using standard techniques and protocols, i.e. as provided by the 
manufacturers of the reagents/kits used and/or as described in the standard handbooks, 
such as Sambrook et al, "Molecular Cloning: A Laboratory Manual" ( 2nd.ed.), Vols. 1-3, 
Cold Spring Harbor Laboratory Press (1989) and F. Ausubel et al, eds., "Current 
protocols in molecular biology", Green Publishing and Wiley Interscience, New York 
(1987). 

Fat accumulation in C. elegans daf-2 (el 370) was determined visually under a 
microscope upon staining wifli Nile-red, using an adaptation of the general methodology 
described by Ogg et al., Nature, Vol. 389, 994 (1997). For the general methodology, 
reference is also made to Thaden et al., 1999 International Worm Meeting abstract 837; 
Ashrafi and Ruvkun, 2000 East Coast Worm Meeting abstract 67; Ashrafi, Chang and 
Ruvkun, 2001 Intemational Worm Meeting abstract 325; and Rottiers and Antebi, 2001 
International Worm Meeting abstract 620 (all abstracts available from Worm Literature 
Index at http://elegans.swmed.edii/wli/). 

Example 1: Preparation jElco/gRNA feeding strain for expression of C56G2.6 double 
stranded RNA. 

A vector for expression of dsRNA for dowiuregulation of C. elegans gene 
C56G2.6 was prepared as follows. 

The DNA fragment of SEQ ED NO: 30, which corresponds to 821 nucleotides of 
the C elegans C56G2.6 gene (SEQ ID N0.27), was obtained by PCR from genomic C 
elegans DNA, using the following primers: 



This fragment was inserted in the SrJI-sitG of expression vector pGN49a pGN49A 
(Figure 1, see also WO 00/01846 and UK patent application no. GB 0012233, both by 
Applicant). This vector contains two T7 promoters flanking the Srfl-sitQ, allowing 



- forward primer : TGCC AGTGCT TCTTGGTTGG 

- reverse primer : AGCTCGAGTT GAAGTTGATG CG 



[SEQ ID NO: 28] 
[SEQ ID NO: 29] 
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transcription of a nucleotide sequence inserted into said Sr/I-site into double stranded 
RNA, upon binding of a T7 polymerase to said promoter (vide WO 00/01846). 

The resulting vector, designated pGN49A- C56G2.6, was transformed overnight 
into E. coli strain AB 309-105 (see EP-A-1 093 526 by applicant, page 17.). 

To normalize the culhire, 250 jil of the overnight culture (1 ml) was transferred to 
a 96 well plate and the OD at 600 nm was measured (Fluostar Galaxy plate reader BMG), 
the remaining 750 ^l centrifuged down. Next the pellet was re-suspended in S-complete 
fed ( S-complete supplemented with O.lmg/ml ampicihne and 1 mM IPTG) and volume 
adjusted to obtain ODeoo value of 1 



Rvam ple 2: Generation of fat storage p benotvpe in C. elezans - PO screen for C. ele^ans 
pene C56G2.6. 

In this example, C. elegans strain CB1370 containing the temperature sensitive 
daf-2 allele e-1370 is used (Ogg et ah, supra). CD 1370 is publicly available from, for 

1 5 example, the Caenorhabditis Genetics Center (CGC), Minnesota, USA). 

To generate the fat-storage phenotype, LI worms of strain CB 1370 were 
cultivated at a temperature of 15 °C in S-Complete fed-medium in the wells of a 96 well 
plate (40 LI nematodes per well) under essentially synchronized conditions, until the 
nematodes reached the L2 stage. 

20 Then, the temperature was increased to 25°C, and the worms were forther 

cultivated at said temperature until they reached the L4 stage (about 36-48 hours). Due to 
the presence of the daf-2 allele e-1370, this raise in temperature from 15*'C to 25°C 
causes the nematodes to accumulate fat, mainly in their intestinal and hypodermal tissue 
(vide Ogg et al. and Figures 2A and 2B). 

25 The accumulation of fat (in the form of droplets) was made visible by means of 

Nile Red staining: L4 animals were washed several times with M9 (supplemented with 
0.1% PEG) to remove the remaining E.coli, and fixed with MeOH (fc. 33%). After 
fixation the nematodes were stained with nile red (fc 0.375 mM in 37.5% MeOH) for 4 
hours. MeOH and excess dye was removed through several washes with M9 

30 (supplemented with 0.1% PEG). The stainuig pattern was visualized under UV using a 
500 nm long pass filter. 
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For testing the influence of the gene C56G2.6 on fat storage, during the steps 
described above, the worms were grown on 20 [xl of the normalized Exoli strain 
containing the pGN49A vector with the RNAi fragment for C56G2.6 inserted therem, as 
obtained in Example 1 (ODeoo = 1), as a food source. As a reference, the daf-2 (el370) 
5 nematodes were grown in a shnilar manner, but with E. coli strain AB 309-105 

contaming vector pGN49A without the RNAi fragment for C56G2.6 inserted therein as a 
food source, used in the same amount. All samples were carried out in quadruplicate. 

The results were as follows: worms fed on E.coli pGN49A- C56G2.6 strain, 
which downregulates the expression of C56G2.6 through RNA interference, showed a 
1 0 strong reduction of the accumulation of fat, compared to the reference (vide Figures 2A 
and 2B). 

These results show that C56G2.6 is involved in the regulation of (the daf-2 
dependent) accumulation of &t in the nematode. From bio-informatic analysis, it was 
found that Genbank protein 7705855/Genbank DNA HSD17B12 is the closest human 

15 ortholog for C56G2.6. As already mentioned above, HSD17B12 (nucleotide sequence 
NM_016142; amino acid sequence NP057226.1) corresponds to KAR mentioned above. 
This result confirms that enzymes that are involved in the elongation of fatty acids (as 
described above) can be used as targets in metabolic disease. 

Also, besides KAR, from bio-informatic analysis, a close human homolog of both 

20 KAR and the C elegans gene C56G2.6 was also established, the nucleotide sequence of 
which is given in SEQ ID NO: 3 1 (mRNA sequence) and SEQ ID NO: 32 (mRNA - 
coding sequence); Ihe amino acid sequence is given in SEQ ID NO: 33. Reference is also 
made to Genbank accession numbers NM_031463 (gi 22432036) for the nucleotide 
sequence and NP_1 1365 1 .3 (gi 22432037) for the amino acid sequence. Without being 

25 limited to any specific hypothesis or explanation, it is assumed that this specific amino 
acid sequence is also involved in the elongation of fatty acids, and possibly in the Second 
Step referred to above. Therefore, the use of Ihe sequences of SEQ ID NOS 31, 32 and in 
particular 33 (as well as anologs thereof, as defined hereinabove) in the methods 
described herein form a fiirther aspect of the invention. 
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CLAIMS: 

1 . Method for identifying a compound that can be used in the prevention and/ or 
treatment of metabolic diseases, said method at least comprising the steps of: 

a) contactmg an amino acid sequence that is involved in the elongation of fatty acids, 
and/or a host cell or host organism containing/expressing such an amino acid 
sequence, with a test chemical, in such a way that a signal may be generated that is 
representative for the interaction between said test chemical and said amino acid 
sequence; and optionally 

b) detecting the signal that may thus be generated, said signal identifying a modulator of 
said amino acid sequence; 

in which the modulator thus identified can be used in (the preparation of a pharmaceutical 
composition for) the prevention and/or treatment of metabolic diseases. 

2. Method for generating a signal that is representative for the interaction of a 
test chemical with amino acid sequence involved in the elongation of fatty acids, said 
method at least comprising the steps of: 

a) contacting said amino acid sequence, or a host cell or host organism 
containing/ejcpressing said amino acid sequence involved in the elongation of fatty 
acids, with said test chemical, in such a way that a signal maybe generated that is 
representative for Ae interaction between said test chemical and said amino acid 

sequence; and optionally 

b) detecting the signal that may thus be generated. 

3. Method according to any of the preceding claims, in which the amino acid 
sequence involved in elongation of fatty acids is chosen from the group consisting of: 

- condensation enzymes involved in the condensation between a fatty acyl-CoA and 
malonyl-CoA to form 3-ketoacyl-CoA; and 

- reductases involved in either the reduction of the 3-ketoacyl-CoA using NADPH to 
form 3-hydioxyacyl-CoA; and/or in the reduction of trans-2,3-enoyl-CoA to saturated 
acyl-CoA. 
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4. Method according to any of the preceding claims, in which tiie amino acid 
sequence involved in elongation of fetty acids is chosen fiom the group consisting of: 

- condensation enzymes involved in the condensation between a fiitty acyl-CoA and 
5 malonyl-CoA to form S-ketoacjd-CoA that have specificity for fetty acids with a 

length of file carbon chain of 20 carbon atoms or less; and 

- reductases involved in either the reduction of the 3 -ketoacyl-CoA using NADPH to 
form 3-hydioxyacyl-CoA; and/or in the reduction of trans-2,3-enoyl-CoA to saturated 
acyl-CoA. 

10 

5. Method according to any ofthe preceding claims, in which the amino acid 
sequence involved in elongation of fetty acids is chosen fix)m the group consisting of: 

- condensation enzymes involved in the condensation between a fetty acyl-Co A and 
malonyl-CoA to form 3-ketoacyl-CoA that have specificity for fetty acids with a 

15 length ofthe carbon chain of 18 carbon atoms or less; and 

- reductases involved in either the reduction of the 3 -ketoacyl-Co A using NADPH to 
form 3-hydroxyacyl-CoA; and/or in the reduction of trans-2,3-enoyl-CoA to saturated 
acyl-CoA. 

20 6. Method according to any of tiie preceding claims, in which the amino acid 

sequence involved in elongation of fetty acids is chosen from the group consisting of: 

- condensation enzymes involved in the condensation between a fetty acyl-CoA and 
malonyl-CoA to form 3-ketoacyl-CoA that have specificity for saturated fatty acids 
with 18 carbon atoms or less in the fatty acid chain, compared to both (1) saturated 

25 and unsaturated fatty acids with more than 1 8 carbon atoms in the fatty acid chain; as 
well as (2) unsaturated fatty acids ofthe (n-3) femily and/or (n-6) family with 18 
carbon atoms or less in the fetty acid chain; and 

- reductases involved in either the reduction of the 3-ketoacyl-CoA usuig NADPH to 
form 3-hydroxyacyl-CoA; and/or in the reduction of trans-2,3-enoyl-CoA to saturated 

30 acyl-CoA. 
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7. Method according to any of the preceding claims, in which the amino acid 
sequence involved in elongation of fatty acids is chosen fiom the group consisting of: 

- condensation enzymes involved in the condensation between a fetty acyl-CoA and 
malonyl-CoA to form 3-ketoacyl-CoA that have specificity for palmitic acid and/or 

5 for stearic acid, compared to both (1) saturated and unsaturated fatty acids with more 
than 18 carbon atoms in the fatty acid chain; as well as (2) linoleic acid and/or alpha- 
linolenic acid; and 

- reductases involved in either the reduction of the 3-ketoacyl-CoA using NADPH to 
form 3-hydioxyacyl-CoA; and/or m the reduction of trans-2,3-enoyl-CoA to saturated 

10 acyl-CoA. 

8. Method according to any of the precedmg claims, in which the amino acid 
sequence involved in elongation of fatty acids is chosen firom the group consisting of: 

- condensation enzymes involved in the condensation between a fatty acyl-CoA and 
1 5 malonyl-CoA to form 3-ketoacyl-CoA that have specificity (as determined by a 

suitable assay, such as one of the assays mentioned hereinabove) for palmitic acid, 
compared to both (1) stearic acid; as well as (2) linoleic acid and alpha-linolenic acid; 
and 

- reductases involved m either the reduction of ftie 3-ketoacyl-CoA using NADPH to 
20 form 3-hydroxyacyl-CoA; and/or in the reduction of trans-2,3-enoyl-CoA to saturated 

acyl-CoA. 

9. Method according to any of the preceding claims, in which the condensation 
enzymes involved in the condensation between a fatty acyl-CoA and malonyl-CoA to 

25 form 3-ketoacyl-CoA contains at least a histidin-rich motif (HXXHH), and prefereably 
also contains an ELO-domain. 

10. Method according to any of the preceding clanns, in which the ammo acid 
sequence involved in elongation of fatty acids is derived from a mammal, and in 

30 particular from a human, or is an analog of such an amino acid sequence derived fi»m a 
manunal, and in particular a human. 
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11. Method according to any of the preceding claims, in which the amino acid 
sequence involved in elongation of fetty acids is chosen from the group consisting of: 
Cig30, Sscl, Ssc2, LCE, ELOVLl, EL0VL2, ELOVL3, EL0VL4, EL0VL5 (HELOl), 
ELO VL6, KAR and TER, and natural or synthetic analogs thereof, and in particular from 
the group consisting of: LCE, EL0VL6, KAR and TER, and natural or synthetic analogs 
thereof 

12. Method according to any of the preceding claims, in which the amino acid 
sequence involved in elongation of fatty acids is chosen from the group consisting of 
SEQ ID NOS: 3, 5, 8, 1 1, 14, 17, 20, 23, 26 and/or 33, and natural or synthetic analogs 
thereof, an in particular from the group consisting of: SEQ ID NO: 3, 23 and/or 26, and 
natural or synthetic analogs thereof 

13. Compound that can be used in the treatment of metabolic diseases, identified 
by a method of any of claims 1-12. 

14. Compound according to claim 13, in which said compound is a modulator of 
an amino acid sequence that is involved in the elongation of fatty acids 

15. Modulator of an amino acid sequence that is involved in the elongation of 
fatty acids, identified by a method of any of claims 1-12. 

16. Pharmaceutical composition, comprising at least one compound or modulator 
according to any of claims 11-13 and at least one pharmaceutically acceptable carrier. 

17. Pharmaceutical composition according to claim 13, being a composition 
suitable and/or intended for oral administration. 

18. Use of a compound or modulator according to any of claims 1 1-13 in the 
preparation of a pharmaceutical composition. 
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19. Use of a compound or modulator according to any of claims 1 1-13 in the 
reparation of a pharmaceutical composition for the prevention and/or treatment of 
metabolic diseases. 

5 
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Figure 1 
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SEQUENCE LISTING 



5 <110> Devgen N.V. 



<120> Use of amino acid sequences involved in the elongation of fatty 
10 acids in identifying and/or developing compounds for preventing and/or 
treating of metabolic diseases. 



15 <130> P03-015 



20 



25 



30 



35 



40 



<160> 33 

<170> Patentin version 3.1 

<210> 1 

<211> 3045 

<212> DNA 

<213> homo sapiens 



<400> 1 

actaagaccg caaggcattc atttcctcct acggtggatg cggacgccgg gaggaggaga 
60 

gccccagaga gaggagctgg gagcggaggc gcaggcaatg ctcagccctg gatgtagctg 
120 



agaggctggg agaagagacg accgctggag accgagcggc gtggggaaga cctagggggg 
45 180 

tgggtggggg aagcagacag gagaacactc gaaatcaagc gctttacaga ttattttatt 
240 

50 ttgtatagag aacacgtagc gactccgaag atcagcccca atgaacatgt cagtgttgac 
300 



tttacaagaa tatgaattcg aaaagcagtt caacgagaat gaagccatcc aatggatgca 
360 



55 
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ggaaaactgg aagaaatctt tcctgttttc 
420 

tcggcaccta atgaataaac gagcaaagtt 
5 480 

tctgaccctt gcagtcttca gtatattcgg 
540 

10 cattttgatg accaaaggcc tgaagcagtc 
600 

tgtcagcaaa ttctgggctt atgcatttgt 
660 

15 

aatattcatt attctgagga agcagaagct 

720 

tgtgctcctg tactcttggt actcctacaa 
20 780 

gactatgaac tatggcgtgc acgccgtgat 
840 

25 tttccgagtc tcccggaagt ttgccatgtt 
900 

gatgggctgt gtggttaact acctggtctt 
960 

30 

tcactttcag aacatcttct ggtcctcact 
1020 

ccatttcttc tttgaggcct acatcggcaa 
35 1080 

ttggaactga ggaggaagcc atagctcagg 
1140 

40 atggcacaag gaatcacacg tggtgcagct 
1200 

acccaaggca gcttagggat aattaggttg 
1260 

45 

gggtgaggac tcactgagtg cacctccatc 
1320 

cctctttatc tatcaactct aggacaaggg 
50 1380 

ctaatcaaag gcaaacaaag gctattaaca 
1440 

55 catttctcta agatgaaaga tttttactct 
1500 



tgctctgtat gctgccttta tattcggtgg 
tgaactgagg aagccattag tgctctggtc 
tgctcttcga actggtgctt atatggtgta 
agtttgtgac cagggttttt acaatggacc 
gctaagcaaa gcacccgaac taggagatac 
gatcttcctg cactggtatc accacatcac 
agacatggtt gccgggggag gttggttcat 
gtactcttac tatgccttgc gggcggcagg 
catcaccttg tcccagatca ctcagatgct 
ctgctggatg cagcatgacc agtgtcactc 
catgtacctc agctaccttg tgctcttctg 
aatgaggaaa acaacgaaag ctgaatagtg 
gtcatcaaga aaaataatag acaaaagaaa 
aaaacaaaac aaaacatgag caaacacaaa 
atttaaccca gtaagtttat gatcctttta 
tccaagcact gctgctggaa gaccccattc 
agaacaaaag caagccagaa gcagaggaga 
cataggaaaa tatgtattta ctaagtgtca 
agaaactgtg cgagcacaac acacacaatc 
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ctttctaact ttatggacac taaactggag 
1560 

cagggtgtat atctagaacg ataatgcttt 
5 1620 

cagctgctgt agaccccatg agaaaagatg 
1680 

10 acaactatat ttcaactaac ttcatcttca 
1740 

aaaccaaagg gggtgaatac ttccccaaga 
1800 

15 

ccaggtccca tgggggcagc tccatcccag 
1860 

actcttaagt gagatatgaa gtattatcct 
20 1 9 2 0 

aagagtaaat acagaattga aaaagataaa 
1980 

25 tgtagtattt gtaaaaaggc ccggcccagg 
2040 

agtgggaaaa gaggtgagcc gaagatcaat 
2100 

30 

ctgtttgact tcacacacac tcataacttt 
2160 

tttcgatatt tttgtgaatt ccaaaaggaa 
35 2220 

cactgtgttt ctgcatcatc tgcatttgct 
2280 

40 cctctgctgg ctgagtggca atactacaac 
2340 

ttgacaacaa agggccccca tacacttatc 
24O0 

45 

tcatgtcttt ggccaaatca gaagatafctc 
2460 

ttaaaaggga ctttagctct ggaactcaaa 
50 2520 

aaaaaaagct ggataatatt atctgtaata 

2580 

55 tcaatgtttc tagtatttca aagaagcaat 
2640 



ccaatagaaa agacaaaaat gaaagagaca 
tgcagaaact aaagcctttt taagaaatgc 
tcttaatcat ccttatgaaa acagatgtaa 
ctgcatagcc tcaggctagt gagtttgcca 
ttcttcctgg gaggatggaa acagtgcagc 
agcatttctg atagttgaac tgtaatttct 
tttgttcagt tgccccgggc ttttgaacag 
cactcaacca aacaatgtga aaacgggttc 
accactgtga gctggaaaag ggagaaaggc 
tcgacagaca gacggtgtgt atgcccctcc 
ccaaatgaaa ccccacagta tagcgcatat 
atcacagggc tgttcgaaat attgggggaa 
ccccaagcaa tgtagaggtg tttaaagggc 
aaacttcaag gcaagtttgg ctgaaaacag 
cctcaaattt taagtgatat gaaatacttg 
atcctgcttc aagtcagctt cagaaatgtt 
atcaatttat taagagccat attctttaaa 
tttcagtcct ttacaagcca aatacatgtg 
tatgtaaagt tgttcaatgt gacataatag 
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tattataatt ggttaagtag cttaatgatt aggcaaacta gatgaaaaga ttaggggctt 
2700 

ccacactgca tagatcacac gcacatagcc acgcatacac acacagacac acagatgtgg 
5 2760 

ggtacactga atttcaaagc ccaaatgaat agaaacacat tttctggcta gcagaaaaaa 
2820 

10 acaaaacaaa actgttgttt ctctttcttg ctttgagagt gtacagtaaa agggattttt 
2880 



15 



45 



tcgaattatt tttatattat tttagcttta attgtgctgt cgttcatgaa acagagctgc 
2940 

tctgcttttc tgtcagagat ggcaagggct ttttcagcat ctcgtttatg tgtggaattt 
3000 



aaaaagaata aagttttatt ccattctgaa aaaaaaaaaa aaaaa 
20 3045 



<210> 2 

25 <211> 798 

<212> DNA 

<213> Homo sapiens 



30 



<400> 2 

atgaacatgt cagtgttgac tttacaagaa tatgaattcg aaaagcagtt caacgagaat 
35 60 

gaagccatcc aatggatgca ggaaaactgg aagaaatctt tcctgttttc tgctctgtat 
120 

40 gctgccttta tattcggtgg tcggcaccta atgaataaac gagcaaagtt tgaactgagg 
180 

aagccattag tgctctggtc tctgaccctt gcagtcttca gtatattcgg tgctcttcga 
240 

actggtgctt atatggtgta cattttgatg accaaaggcc tgaagcagtc agtttgtgac 
300 



cagggttttt acaatggacc tgtcagcaaa ttctgggctt atgcatttgt gctaagcaaa 
50 360 

gcacccgaac taggagatac aatattcatt attctgagga agcagaagct gatcttcctg 
420 

55 cactggtatc accacatcac tgtgctcctg tactcttggt actcctacaa agacatggtt 
480 
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gccgggggag gttggttcat gactatgaac tatggcgtgc acgccgtgat gtactcttac 
540 

tatgccttgc gggcggcagg tttccgagtc tcccggaagt ttgccatgtt catcaccttg 
5 600 

tcccagatca ctcagatgct gatgggctgt gtggttaact acctggtctt ctgctggatg 
660 

10 cagcatgacc agtgtcactc tcactttcag aacatcttct ggtcctcact catgtacctc 
720 

agctaccttg tgctcttctg ccatttcttc tttgaggcct acatcggcaa aatgaggaaa 
780 

acaacgaaag ctgaatag 
798 



15 



20 <210> 3 

<211> 265 

<212> PRT 

<213> Homo sapiens 



25 



30 <400> 3 



35 



50 



Met Asn Met Ser Val Leu Thr Leu Gin Glu Tyr Glu Phe Glu Lys Gin 
15 10 15 



Phe Asn Glu Asn Glu Ala lie Gin Trp Met Gin Glu Asn Trp Lys Lys 
20 25 30 



40 Ser Phe Leu Phe Ser Ala Leu Tyr Ala Ala Phe He Phe Gly Gly Arg 
35 40 45 



His Leu Met Asn Lys Arg Ala Lys Phe Glu Leu Arg Lys Pro Leu Val 
45 50 55 60 



Leu Trp Ser Leu Thr Leu Ala Val Phe Ser He Phe Gly Ala Leu Arg 
65 70 75 80 



Thr Gly Ala Tyr Met Val Tyr He Leu Met Thr Lys Gly Leu Lys Gin 
85 90 95 



55 

Ser Val Cys Asp Gin Gly Phe Tyr Asn Gly Pro Val Ser Lys Phe Trp 
100 105 110 
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Ala Tyr Ala Phe Val Leu Ser Lys Ala Pro Glu Leu Gly Asp Thr lie 
115 120 125 

5 

Phe lie He Leu Arg Lys Gin Lys Leu He Phe Leu His Trp Tyr His 
130 135 140 

10 

His He Thr Val Leu Leu Tyr Ser Trp Tyr Ser Tyr Lys Asp Met Val 
145 150 155 160 



15 Ala Gly Gly Gly Trp Phe Met Thr Met Asn Tyr Gly Val His Ala Val 

165 170 175 



Met Tyr Ser Tyr Tyr Ala Leu Arg Ala Ala Gly Phe Arg Val Ser Arg 
20 180 185 190 



Lys Phe Ala Met Phe He Thr Leu Ser Gin He Thr Gin Met Leu Met 
195 200 205 

25 

Gly Cys Val Val Asn Tyr Leu Val Phe Cys Trp Met Gin His Asp Gin 
210 215 220 

30 

Cys His Ser His Phe Gin Asn He Phe Trp Ser Ser Leu Met Tyr Leu 
225 230 235 240 



35 Ser Tyr Leu Val Leu Phe Cys His Phe Phe Phe Glu Ala Tyr He Gly 

245 250 255 



Lys Met Arg Lys Thr Thr Lys TQa Glu 
40 260 265 



<210> 4 

45 <211> 3324 

<212> DNA 

<213> Homo sapiens 

50 



<400> 4 

aaatttattg agctattaga tacgatcttt tttgttctgc gcaagaaaaa tagccaagtg 
55 60 
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actttccttc atgtattcca tcataccatc 
120 

tttgctgcag gtggtttggg aacattccat 
5 180 

atgtattcct actatggact ttctgcattg 
240 

10 aaaaaatatt tgacatcatt acagcttgtc 
300 

cagttctttt tcatggagga ttgcaagtat 
360 

15 

agttacagtt tcatgtttct gctgctcttt 
420 

ggtcagaggt tgcccaaaac tgtgaaaaat 
20 480 

ccaacataag tctatgatcg aaactgatac 
540 

25 acctatgcag tgcattttgt atatttttca 
600 

tattgggatg tctgatattt gagagcccga 
660 

30 

gagcctacag cagccagaat ttgtttttgt 
720 

acctaatcca aaggttttgt ttaaaacatt 
35 780 

tacagtactc ttcaaagaag tccaataaag 
840 

40 agacggagat ccatgtgata aaatagattc 
900 

acctgtaggc cctggctttg tcatttattg 
960 

45 

agagttaact cccattttag caagctagca 
1020 

ttgtcattta aaatcattaa tgataatttt 
50 1080 

tttgctgata agacttcctc acaacccatt 
1140 

55 ctttgggtta gtacacttga taccaagttt 
1200 
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atgccgtgga cctggtggtt tggagtcaaa 
gcccttctaa atacagctgt acatgtagtc 
gggccagcct accagaagta tttgtggtgg 
cagtttgtta ttgtcgccat ccacataagc 
cagtttccag tctttgcgtg catcattatg 
ctccattttt ggtaccgtgc ttacaccaaa 
ggaacttgca aaaacaaaga taattgaagc 
attgtcttcc ttgacaatca agagatattt 
aaactaagag cttgtatttt tatggtaagt 
agcttccaga taacagtctt ctgataatta 
ttttgttttt gtttttttag ctgcttttaa 
ttgttgcagt gaaaaaaaga tatgaagcaa 
atgaaaaata tcattaggta ttttgagcac 
cttctgctgg gtctggaaag tctggaaacc 
actgtattca gaagatacta tatttgctct 
caggtgaaaa ttgagtgagt ttttgataac 
caatggatct tttcagtagc catcaccagt 
tgttgtacaa actgtttcaa agtagcaatc 
cattcagttg attattttca aaacaaggtg 
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atttgtttta atgggttaat gaatactttg ctattactgt tttacaatta actttgtata 
1260 

tctctggaaa gaggtgaatt tgtcaatgaa aaaagtattg tgtagttcag tgggaaaaac 
5 1320 

ctgttgtctg ttatagtatc acatcacttt cttacattgt catggtttaa attattatct 
1380 



10 



15 



30 



45 



tggtgaaata tataggttag atacttaaat gttcatatta ttagcacact acaggtacta 
1440 

cccttaaatc ataaataata ttacccattg tacacagttt ccagatatga gttagagtgg 
1500 

ctgtgaagca actagaggca aattgtggca cagagaatat ccaggggaaa attgattatg 
1560 



taagcaaggg ctgttctact ttgagagaga gacagacaga ctacataata gtaatatatt 
20 1620 

atataattaa tgttatagat acactgaaca ataaaacatt ttctggaatt atgaattaat 
1680 

25 taacaacctg gatttctgtt tccagaccta taaaccatgt aatgaaggac tgaaattcgt 
1740 

ttggcataca ctttgtttct ttaaaattgc tagtttcttt ctgttatttt tacattttct 



1800 



tgtcagagaa tcaaaactat tagtcagtag agttttgtcc atgaaataat atttggacat 
1860 



ttgtgaattt ttccctattt tttttcttct tttatttata ctcaattttg aagcactgtt 
35 1920 

tatgtttgta ggactttaaa caattagtac ttaaagcccc agttaatttt gaacacacag 
1980 

40 agatacttat tgtactctgt gtacagtaag tatttttatg tttacactta ccagattcat 
2040 

aaggttattt gcctttaagt gatcttttgt gattttactt gattacagca tgagaaggta 
2100 



aaggttgata aatgggagta attacatata tatatataat ttattatttt tttttctgaa 
2160 



gcagagtttt actcttgttg cccaggctag agtgcaatgg cacaatctcg gttcactgca 
50 2220 

acctctgcct cccaggttca agtgattctc ctgtctcagc ctcccaagta gctgggatta 
2280 

55 caggtgcccg ccaccatgct cagctaattt ttgtattttt agtagagaca gggttggcca 
2340 
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ggctggtctc gagctcctga cctcaggtga 
2400 

gattacaggg gtgagccact gtgcccggcc 
5 2460 

gaaggacaaa agagcacagc taaatgaaac 
2520 

10 ctcaggactc cttaactgtt ttccaggttt 
2580 

agaaggtatg ctaggtgctg ttagaaatca 
2640 

15 

agaccaaaat gaggcaaaaa taaaagagat 
2700 

tccgaccttc atcatacaac ctctcctaac 
20 2760 

cctataagct aacatgtaat aaaacacata 
2820 

25 gaatgctaca aaagaaatca cagctgctag 
2880 

tgtggatcaa aatgattttc actggatatg 
2940 

30 

gttctgtgtg gtcacataga gttcctttgg 

3000 

gtttgggttt gcttacctaa caagcaacaa 
35 3060 

gtttattaca taaatctcta tgaatccttg 
3120 

40 aaatatagat ttttaaatat ctatgggact 
3180 

cgagattata ttctttccat taccctttat 
3240 

45 

ctgtggatgt tgtcaaagtt tgagtttgtt 
3300 

attgcaaaaa aaaaaaaaaa aaaa 
50 3324 
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tcctcccacc tcggcctccc aaagttctgg 
ataattttat atttttagaa aataaagaat 
tgctgctcta atttattcca ctggaaaggt 
ggctcttcat atctaacctg tgctaaaatg 
caccatttcc aaagaaccca agtagtatag 
gaaacaggag agtttatttc ttgcatactt 
ctccctgtag tctttaaaat gtttaacttg 
ctcaattata tgaatagaag tggagagcca 
aagtatcctc catagaaaca tcaacatgat 
gaatttgtat gggccatatt tattaaaaga 
gatttcatcc cacttttcag gacttaattt 
caacaacaaa ataaatagtt ccaaaatcta 
agggcctata tctatattat aaataaatac 
ttgccattta cagccttaag tataaaatta 
ttctgctaac tttttaaaga ctggaacatt 
tttccctgtg tattataata aatttgtggt 



55 



<210> 5 
<211> 158 
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<:212> PRT 

<213> Homo sapiens 

5 

<4O0> 5 

Lys Phe He Glu Leu Leu Asp Thr He Phe Phe Val Leu Arg Lys Lys 
10 1 5 10 15 



Gin val Thr Phe Leu His Val Phe His His Thr lie Met Pro 

15 



Asn Ser 

20 



25 30 



20 



Trp Thr Trp Trp Phe Gly Val Lys Phe Ala Ala Gly Gly Leu Gly Thr 
35 40 45 

Phe His Ala Leu Leu Asn Thr Ala Val His Val Val Met Tyr Ser Tyr 
50 55 60 

25 Tyr Gly Leu Ser Ala Leu Gly Pro Ala Tyr Gin Lys Tyr Leu Trp Trp 



65 70 



75 



Lys Lys Tyr Leu Thr Ser Leu Gin Leu Val Gin Phe Val He Val Ala 
30 85 90 ^= 



He His lie Ser Gin Phe Phe Phe Met Glu Asp Cys Lys Tyr Gin Phe 
100 105 110 

35 

Pro Val Phe Ala Cys He He Met Ser Tyr Ser Phe Met Phe Leu Leu 
115 120 125 

Leu Phe Leu His Phe Trp Tyr Arg Ala Tyr Thr Lys Gly Gin Arg Leu 
130 135 140 



150 155 



45 Pro Lys Thr Val Lys Asn Gly Thr Cys Lys Asn Lys Asp Asn 
145 



<210> 6 

50 

<211> 1347 



<212> DNA 
55 <213> Homo sapiens 
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cgccgccgcg cacatgccta ggttcgacgc cctcctccct ttgcccagga gttccttctg 
5 60 

tcccggctct gttccgtctc gccccgaggt tcacgccatc ctcggagccc cagcctttca 
120 

10 cccagcgcct ccaagctttg gaccttgact tctgcaaaac tagatggtca cagccatgaa 
180 

tgtctcacat gaagtaaatc agctgttcca gccctataac ttcgagctgt ccaaggacat 
240 

gaggcccttt ttcgaggagt attgggcaac ctcattcccc atagccctga tctacctggt 
300 

tctcatcgct gtggggcaga actacatgaa ggaacgcaag ggcttcaacc tgcaagggcc 
20 360 

tctcatcctc tggtccttct gccttgcaat cttcagtatc ctgggggcag tgaggatgtg 
420 

25 gggcattatg gggactgtgc fcacttaccgg gggcctaaag caaaccgtgt gcttcatcaa 
480 

cttcatcgat aattccacag tcaaattctg gtcctgggtc tttcttctca gcaaggtcat 
540 

30 

agaactcgga gacacagcct tcatcatcct gcgtaagcgg ccactcatct ttattcactg 
600 

gtaccaccac agcacagtgc tcgtgtacac aagctttgga tacaagaacsL aagtgcctgc 
35 660 

aggaggctgg ttcgtcacca tgaactttgg tgttcatgcc atcatgtaca cctactacac 
720 

40 tctgaaggct gccaacgtga agccccccaa gatgctgccc atgctcatca ccagcctgca 
780 

gatcttgcag atgtttgtag gagccatcgt cagcatcctc acgtacatct ggaggcagga 
840 

45 

tcagggatgc cacaccacga tggaacactt attctggtcc ttcatcttgt atatgaccta 
900 

tttcatcctc tttgcccact tcttctgcca gacctacatc aggcccaagg tcaaagccaa 
50 960 

gaccaagagc cagtgaaggt ttggagagaa caatgaagct ccaggctctc tcttctccag 
1020 

55 ggcaccaaga ggctgggctt agttttggga gaatgattag gttgccttac ctgcatggtt 
1080 
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tccccagagg atgtgtgccc caaggtggct ggaatttttg acagacaaga agggtgacct 
1140 

tgggatgggg gtgtggtctg ttactttaat gtttctgttt ttaatgtgaa ggccaagcag 
5 1200 

gccctgggat gggagtgggg cggaggaggg tcctaagagc tgattattta atttctatcc 
1260 

10 agaaatcttt cttcttcttg ctctgttttt ttaaattaaa gatttcaaca aaaaaaaaaa 
1320 



15 



20 



25 



30 



45 



aaaaaaaaaa aaaaaaaaaa aaaaaaa 
1347 



<210> 7 

<211> 813 

<212> DNA 

<213> Homo sapiens 



<400> 7 

atggtcacag ccatgaatgt ctcacatgaa gtaaatcagc tgttccagcc ctataacttc 
60 

gagctgtcca aggacatgag gccctttttc gaggagtatt gggcaacctc attccccata 
120 



gccctgatct acctggttct catcgctgtg gggcagaact acatgaagga acgcaagggc 
35 180 

ttcaacctgc aagggcctct catcctctgg tccttctgcc ttgcaatctt cagtatcctg 
240 

40 ggggcagtga ggatgtgggg cattatgggg actgtgctac ttaccggggg cctaaagcaa 
300 

accgtgtgct tcatcaactt catcgataat tccacagtca aattctggtc ctgggtcttt 
360 



cttctcagca aggtcataga actcggagac acagccttca tcatcctgcg taagcggcca 
420 



ctcatcttta ttcactggta ccaccacagc acagtgctcg tgtacacaag ctttggatac 
50 480 

aagaacaaag tgcctgcagg aggctggttc gtcaccatga actttggtgt tcatgccatc 
540 

55 atgtacacct actacactct gaaggctgcc aacgtgaagc cccccaagat gctgcccatg 
600 
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ctcatcacca gcctgcagat cttgcagatg tttgtaggag ccatcgtcag catcctcacg 
660 

tacatctgga ggcaggatca gggatgccac accacgatgg aacacttatt ctggtccttc 
5 720 

atcttgtata tgacctattt catcctcttt gcccacttct tctgccagac ctacatcagg 
780 

10 cccaaggtca aagccaagac caagagccag tga 
813 



<210> 8 

15 

<211> 270 

<212> PRT 

20 <213> Homo sapiens 



<400> 8 

25 

Met Val Thr Ala Met Asn Val Ser His Glu Val Asn Gin Leu Phe Gin 
15 10 15 

30 Pro Tyr Asn Phe Glu Leu Ser Lys Asp Met Arg Pro Phe Phe Glu Glu 
20 25 30 

Tyr Trp Ala Thr Ser Phe Pro He Ala Leu He Tyr Leu Val Leu He 
35 35 40 45 

Ala Val Gly Gin Asn Tyr Met Lys Glu Arg Lys Gly Phe Asn Leu Gin 
50 55 60 

40 

Gly Pro Leu He Leu Trp Ser Phe Cys Leu Ala He Phe Ser He Leu 
65 70 75 80 

45 

Gly Ala Val Arg Met Trp Gly He Met Gly Thr Val Leu Leu Thr Gly 
85 90 95 

50 Gly Leu Lys Gin Thr Val Cys Phe He Asn Phe He Asp Asn Ser Thr 
100 105 110 

Val Lys Phe Trp Ser Trp Val Phe Leu Leu Ser Lys Val He Glu Leu 
55 115 120 125 
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Gly Asp Thr Ala Phe lie lie Leu Arg Lys Arg Pro Leu He Phe He 
130 135 140 



5 His Trp Tyr His His Ser Thr Val Leu Val Tyr Thr Ser Phe Gly Tyr 
145 150 155 160 



Lys Asn Lys Val Pro Ala Gly Gly Trp Phe Val Thr Met Asn Phe Gly 
10 165 170 175 

Val His Ala He Met Tyr Thr Tyr Tyr Thr Leu Lys Ala Ala Asn Val 
180 185 190 

15 

Lys Pro Pro Lys Met Leu Pro Met Leu He Thr Ser Leu Gin He Leu 
195 200 205 

20 

Gin Met Phe Val Gly Ala He Val Ser He Leu Thr Tyr He Trp Arg 
210 215 220 



25 Gin Asp Gin Gly Cys His Thr Thr Met Glu His Leu Phe Trp Ser Phe 
225 230 235 240 



He Leu Tyr Met Thr Tyr Phe He Leu Phe Ala His Phe Phe Cys Gin 
30 245 250 255 



Thr Tyr He Arg Pro Lys Val Lys Ala Lys Thr Lys Ser Gin 
260 265 270 

35 

<210> 9 
<211> 3011 

40 

<212> DNA 

<213> Homo sapiens 

45 

<400> 9 

ggcacgagcc aaccccggcc taggctctcc accgcatcgg attctggaat ttacgatcac 
60 

50 

gaaagttcta ttgtcccgcg attggctccc gggccgcatg acatcatagc gcttgattca 
120 

tccttcgggt cccgattggc tggccgcgcc attgtgacgt cacggtcagc ccacgttctg 
55 180 
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attgtagata gccggcgcct tcctcttccc atcgcgcggg tcctagccac cggtgtctcc 
240 

ttctacatcc gcctctgcgc cggctgccac ccgcgctccc tccgccgccg ccgccttgct 
5 300 

gctgctcaaa gctgctgccg ccccttgggc taaaaggttt tcaaatgga'a cattttgatg 



10 



15 



30 



45 



360 



catcacttag tacctatttc aaggcattgc taggccctcg kgatactaga gtaaaaggat 



420 



ggtttcttct ggacaattat atacccacat ttatctgctc tgtcatatat ttacfcaattg 



480 



tatggctggg accaaaatac atgaggaata aacagccatt ctcttgccgg gggattttag 
540 

tggtgtataa ccttggactc acactgctgt ctctgtatat gttctgtgag ttagtaacag 
20 600 

gagtatggga aggcaaatac aacttcttct gtcagggcac acgcaccgca ggagaatcag 
660 

25 atatgaagat tatccgtgtc ctctggtggt actacttctc caaactcata gaatttatgg 
720 



acactttctt cttcatcctg cgcaagaaca accaccagat cacggtcctg cacgtctacc 
780 



accai 
840 



tgcctc gatgctgaac atctggtggt ttgtgatgaa ctgggtcccc tgcggccact 



cttattttgg tgccacactt aatagcttca tccacgtcct catgtactct tactatggtt 
35 900 

tgtcgtcagt cccttccatg cgtccatacc tctggtggaa gaagtacatc actcaggggc 
960 

40 agctgcttca gtttgtgctg acaatcatcc agaccagctg cggggtcatc tggccgtgca 
1020 

cattccctct tggttggttg tatttccaga ttggatacat gatttccctg attgctctct 
1080 



tcacaaactt ctacattcag acctacaaca agaaaggggc ctcccgaagg aaagaccacc 
1140 



tgaaggacca ccagaatggg tccatggctg ctgtgaatgg acacaccaac agcttttcac 
50 1200 

ccctggaaaa caatgtgaag ccaaggaagc tgcggaagga ttgaagtcaa agaattgaaa 
1260 

55 ccctccaaac cacgtcatct gattgtaagc acaatatgag ttgtgcccca atgctcgtta 
1320 
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acagctgctg taactagtct ggcctacaat agtgtgattc atgtaggact tctttcatca 
1380 

attcaaaacc cctagaaaac gtatacagat tatataagta gggataagat ttctaacatt 
5 1440 

tctgggctct ctgacccctg cgctagactg tggaaaggga gtattattat agtatacaac 
1500 

10 * actgctgttg ccttattagt tataacatga taggtgctga attgtgattc acaatttaaa 
, 1560 



15 



30 



45 



aacactgtaa tccaaacttt tttttttaac tgtagatcat gcatgtgatt gtaaatgtaa 
1620 

atttgtacaa tgttgttatg gtagagaaac acacatgcct taaaatttaa aaagcagggc 
1680 



' ccaaagctta ttagtttaaa ttagggfcatg tttcaagttt gtattaattt gtaatagctc 
20 1740 

tgtttagaaa aaatcaaaga ccatgattta tgaaactaat gtgacataat ttccagtgac 
1800 

25 ttgttgatgt gaaatcagac acggcacctt cagttttgta ctattggctt tgaatcaagc 
1860 

aggctcaaat ctagtggaac agtcagttta actttttaac agatcttatt tttttatttt 
1920 



gagtgccact attaatgtaa aaaggggggg gctctacagc agtcgtgatg aaacttaaat 
1980 



atatattctt tgtcctcgag attttaggaa gggtgtaggg tgagtaggcc atttttaatt 
35 2040 

tctgaagtgc taagtgtttt tatacagcaa acaaaaagtc aattttgctt tccaccagtg 
2100 

40 cgagagagga tgtatacttt tcaagagaga tgattgccta tttaccgttt gacagagtcc 
2160 

cgtagatgag caatggggaa ctggttgcca gggtctaaat ttggattgat ttatgcactg 
2220 



ttatctgttt tgacacagat ttccttgtaa aatgtgccta gtttaccaaa attaacaaag 
2280 



ggggggaaag gaccttagaa ctttttaagg taaaatcaaa tatagctaca gcataagaga 
50 2340 

atcgagaaat ttgatagagg taacttgttt aatgtaaatc taatagtact tgtaatttct 
2400 

55 ttctgcttag aatctaaaga tgtgtttaga acctcttgtt taaaaataat agactgctta 
2460 
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tcataaaatc acatctcaca catttgaggc agtggtcaaa caggtaaagc ctatgatgtg 
2520 

tgtcatttta aagtgtcgga atttagcctc tgaatacctt ctccattggg ggaaagatat 
2580 

tcttggaacc actcatgaca tatcttagaa ggtcattgac aatgtataaa ctaattgttg 
2640 



gtttgatatt tatgtaaata tcagtttacc atgctttaat tttgcacatt cgtactatag 
2700 



ggagcctatt ggttctctat tagtcttgtg ggttttctgt ttgaaaagga gtcatggcat 
2760 



ctgtttacat ttaccttatc aaacctagaa tgtgtatatt tataaatgta tgtcttcatt 
2620 

gctaggtact aatttgcaga tgtctttaca tatttcaata cagaaactat aacattcaat 
2880 

agtgtgctgt caaagtgtgc ttagctcacc tggatatacc tacattgtta aatgtctaaa 
2940 



cagtaatcat taaaacattt ttgattacct gtgaaaaaaa aaaaaaaaaa aaaaaaaaaa 
3000 

aaaaaaaaaa a 
3011 



<210> 10 

<211> 900 

<212> DNA 

<213> Homo sapiens 



<400> 10 

atggaacatt ttgatgcatc acttagtacc 
60 



actagagtaa aaggatggtt tcttctggac 
120 

atatatttac taattgtatg gctgggacca 
180 

tgccggggga ttttagtggt gtataacctt 
240 

tgtgagttag taacaggagt atgggaaggc 
300 



tatttcaagg cattgctagg ccctcgagat 
aattatatac ccacatttat ctgctctgtc 
aaatacatga ggaataaaca gccattctct 
ggactcacac tgctgtctct gtatatgttc 
aaatacaact tcttctgtca gggcacacgc 
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accgcaggag aatcagatat gaagattatc cgtgtcctct ggtggtacta cttctccaaa 
360 

ctcatagaat ttatggacac tttcttcttc atcctgcgca agaacaacca ccagatcacg 
5 420 

gtcctgcacg tctaccacca tgcctcgatg ctgaacatct ggtggtttgt gatgaactgg 
480 

10 gtcccctgcg gccactctta ttttggtgcc acacttaata gcttcatcca cgtcctcatg 
540 



15 



tactcttact atggtttgtc gtcagtccct tccatgcgtc catacctctg gtggaagaag 
600 

tacatcactc aggggcagct gcttcagttt gtgctgacaa tcatccagac cagctgcggg 
660 



gtcatctggc cgtgcacatt ccctcttggt tggttgtatt tccagattgg atacatgatt 
20 720 

tccctgattg ctctcttcac aaacttctac attcagacct acaacaagaa aggggcctcc 
780 

25 cgaaggaaag accacctgaa ggaccaccag aatgggtcca tggctgctgt gaatggacac 
840 

accaacagct tttcacccct ggaaaacaat gtgaagccaa ggaagctgcg gaaggattga 
900 

30 

<210> 11 
<211> 299 

35 

<212> PRT 

<213> Homo sapiens 

40 

<400> 11 

Met Glu His Phe Asp Ala Ser Leu Ser Thr Tyr Phe Lys Ala Leu Leu 
45 1 5 10 15 



50 



55 



Gly Pro Arg Asp Thr Arg Val Lys Gly Trp Phe Leu Leu Asp Asn Tyr 
20 25 30 



lie Pro Thr Phe lie Cys Ser Val lie Tyr Leu Leu lie Val Trp Leu 
35 40 45 



Gly Pro Lys Tyr Met Arg Asn Lys Gin Pro Phe Ser Cys Arg Gly lie 
50 55 60 
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Leu Val Val Tyr Asn Leu Gly Leu Thr Leu Leu Ser Leu Tyr Met Phe 
65 70 75 80 



Cys Glu Leu Val Thr Gly Val Trp Glu Gly Lys Tyr Asn Phe Phe Cys 
85 90 95 



Gin Gly Thr Arg Thr Ala Gly Glu Ser Asp Met Lys He He Arg Val 
100 105 110 



Leu Trp Trp Tyr Tyr Phe Ser Lys Leu He Glu Phe Met Asp Thr Phe 
115 120 125 



Phe Phe He Leu Arg Lys Asn Asn His Gin He Thr Val Leu His Val 
130 135 140 



Tyr His His Ala Ser Met Leu Asn He Trp Trp Phe Val Met Asn Trp 
145 150 155 160 



Val Pro Cys Gly His Ser Tyr Phe Gly Ala Thr Leu Asn Ser Phe He 
165 170 175 



His Val Leu Met Tyr Ser Tyr Tyr Gly Leu Ser Ser Val Pro Ser Met 
180 185 190 



Arg Pro Tyr Leu Trp Trp Lys Lys Tyr He Thr Gin Gly Gin Leu Leu 
195 200 205 



Gin Phe Val Leu Thr He He Gin Thr Ser Cys Gly Val He Trp Pro 
210 215 220 



Cys Thr Phe Pro Leu Gly Trp Leu Tyr Phe Gin He Gly Tyr Met He 
225 230 235 240 



Ser Leu He Ala Leu Phe Thr Asn Phe Tyr He Gin Thr Tyr Asn Lys 
245 250 255 



Lys Gly Ala Ser Arg Arg Lys Asp His Leu Lys Asp His Gin Asn Gly 
260. 265 270 



Ser Met Ala Ala Val Asn Gly His Thr Asn Ser Phe Ser Pro Leu Glu 
275 280 285 
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Asn Asn Val Lys Pro Arg Lys Leu Arg Lys Asp 
290 295 

5 

<210> 12 
<211> 2992 
10 <212> DNA 

<213> Homo sapiens 



15 

<400> 12 

ctcgtctttc tcccgggaac cttgacgacg ccttccgctt ggccctgcct tctgccgcat 
60 

20 ccccgccgcc gcggcgcctt gaggagcagg agaagacgca gccgggccgc cgccgttaga 
120 

ggggttcccg gccgccgctc gccccgtcgg ccgccaccgc ctccggggtc agccctctct 
180 

25 

ctgggtctcc gctttctcct gccgccagcg cccgctcatc gccgcgatgg ggctcctgga 
240 

ctcggagccg ggtagtgtcc taaacgtagt gtccacggca ctcaacgaca cggtagagtt 
30 300 

ctaccgctgg acctggtcca tcgcagataa gcgtgtggaa aattggcctc tgatgcagtc 
360 

35 tccttggcct acactaagta taagcactct ttatctcctg tttgtgtggc tgggtccaaa 
420 

atggatgaag gaccgagaac cttttcagat gcgtctagtg ctcattatct ataattttgg 
480 

gatggttttg cttaacctct ttatcttcag agagttattc atgggatcat ataatgcggg 
540 



40 



atatagctat atttgccaga gtgtggatta ttctaataat gttcatgaag tcaggatagc 
45 600 

tgctgctctg tggtggtact ttgtatctaa aggagttgag tatttggaca cagtgttttt 

660 

50 tattctgaga aagaaaaaca accaagtttc tttccttcat gtgtatcatc actgtacgat 
720 

gtttaccttg tggtggattg gaattaagtg ggttgcagga ggacaagcat tttttggagc 
780 

55 

ccagttgaat tcctttatcc atgtgattat gtactcatac tatgggttaa ctgcatttgg 
840 
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I 



cccatggatt cagaaatatc tttggtggaa 
900 

5 attccatgtg accattgggc acacggcact 
960 

atggatgcac tgggctctaa ttgcctatgc 
1020 

10 

ctacattcgg acatacaaag agcctaagaa 
1080 

tatttcagca aatggtgtga gcaaatcaga 
15 1140 

gcagaaaaat ggaaaagcaa aaggagatta 
1200 

20 gtgaggaaaa actcccatat catataaaat 
1260 

9g99gtgggg agaaaagaca aatgtgctct 
1320 

25 

tgttaatacc atacccagat gttttattta 
1380 

aaattagcct tgatattctc cagaccaaag 
30 1440 

cctgttcaca tccagttgtc taaaggatga 
1500 

35 cttcggtctg aattttagac gatcacagaa 
1560 

ctaattatct tatctactga ctgaaatcag 
1620 

40 

aatgtgtata cacttaaact tgcacatttg 
1680 

tctccacctc tagtcagagt ataattttgg 
45 1740 

tctgtagaat cttagaggct tgatgatgat 
1800 

50 taaagtcttg tctggtgacc cagagatcac 
1860 

aacaattttt tgtgatgaaa aggcagcatt 
1920 

55 

tatgaaaatt aagattattg tctgattgaa 
1980 



acgatacctg actatgttgc aactgattca 
gtctctttac actgactgcc ccttccccaa 
aatcagcttc atatttctct ttcttaactt 
accaaaagct ggaaaaacag ccatgaatgg 
aaaacaactc atgatagaaa atggaaaaaa 
aattgaactg ggccttaact gttgttgaca 
ttcagggaaa acagaagcaa aggagagctt 
atgtcctagt aactcttaga ctgagtaaag 
tgaagttttt attttaaaca ttttttttaa 
caatcattaa gtgactttgg ggattctccc 
gatttttcat gtatcttata gtcactcatt 
acggtcttta tgaattattt tgataaatta 
tggtgttaca ttttcttgtc caaagctgaa 
aattcatttg ctgaccggaa tggtcaaatc 
ttgtaattaa atttttaaaa tctgctgatc 
99tgttggtg aaaataagaa agaattgcag 
catgacttga ggcacaaatc actgtgggga 
tgaatactcc tgttagtagc agaaatatat 
acatgaaaca actcatgtct ttattagtaa 
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catcataaga tagttacatt tatgtgctgt tagaatatgt tgatttttat caggctttcc 
2040 

5 ttgttttgat ttatggctgt tcctgatttt tcatatgtgg aaatatacct acctcttccg 
2100 

ttggaaagaa catttaaaat taaataaatt ttaattaaaa aatcaaggag tcttctaatg 
2160 

10 

taaattttaa tgttaacttt caaatccact agtatttttt ttgcttttat gacaaatagc 
2220 

atacaccaaa catttctgtg aaactatcct tctctttcaa tgtgtttaat tttggagtaa 
15 2280 

cgttttcctt gtgactaagt tgcaagatct tatttattaa ctaggtatga agtataaacc 
2340 

20 cattttggtg caatattctt gactccttgg tgctaaagat tgttaaattc aatgcttgat 
2400 

gttacaaggt gttgttaaaa cacaaaatga ataaaagtga gagtagtcag aactataaca 
2460 

25 

ttcaatttgc tatttacaaa tgaagtattt catgtaatat aagtgaacaa ctggaaataa 
2520 

agtaggaaag aatttgtatc atgttttact acataggtta attttttaag ggatgttgca 
30 2580 

aagggattac tagagaaaga caaaatgtga ccaaaaaaaa gcatgaatat ttcttaagta 
2640 

35 tctcaacaac atgtcaaagc tgcatgtgta ggatgtatgc tgtttgtaca aactatttca 
2700 

gaatattttg taagcfcataa catatttatt gtgcattaaa attaaatact ttttccccaa 
2760 

40 

aggcatgcag tcatgagaat tacagaaaat ttgcaacata taaagtagtt tgatctaaga 
2820 

ggattcaaca cctttgtttt gttgctcagt gtgtaatgac tgagatttgt aaatctttgt . 
45 2880 

gaacattctg tactggttcc caagagctat tcattccctg ctacctgatt tcagcacaat 
2940 

50 aaatatactt ctgctgtggg aaaaataaaa aaaaaaaaaa aaaaaaaaaa aa 
2992 



<210> 13 

55 

<211> 945 
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10 



20 



25 



40 



<212> DNA 

<213> Homo sapiens 



<400> 13 

atggggctcc tggactcgga gccgggtagt gtcctaaacg tagtgtccac ggcactcaac 
60 

gacacggtag agttctaccg ctggacctgg tccatcgcag ataagcgtgt ggaaaattgg 
120 



cctctgatgc agtctccttg gcctacacta agtataagca ctctttatct cctgtttgtg 
15 180 

tggctgggtc caaaatggat gaaggaccga gaaccttttc agatgcgtct agtgctcatt 



240 



atctataatt ttgggatggt tttgcttaac ctctttatct tcagagagtt attcatggga 



300 



tcatataatg cgggatatag ctatatttgc cagagtgtgg attattctaa taatgttcat 



•360 



gaagtcagga tagctgctgc tctgtggtgg tactttgtat ctaaaggagt tgagtatttg 
420 

gacacagtgt tttttattct gagaaagaaa aacaaccaag tttctttcct tcatgtgtat 
30 '480 

catcactgta cgatgtttac cttgtggtgg attggaatta agtgggttgc aggaggacaa 
540 

35 gcattttttg gagcccagtt gaattccttt atccatgtga ttatgtactc atactatggg 
600 

ttaactgcat ttggcccatg gattcagaaa tatctttggt ggaaacgata cctgactatg 



660 



ttgcaactga ttcaattcca tgtgaccatt gggcacacgg cactgtctct ttacactgac 



720 



tgccccttcc ccaaatggat gcactgggct ctaattgcct atgcaatcag cttcatattt 
45 780 

ctctttctta acttctacat tcggacatac aaagagccta agaaaccaaa agctggaaaa 
840 

50 acagccatga atggtatttc agcaaatggt gtgagcaaat cagaaaaaca actcatgata 
900 



gaaaatggaa aaaagcagaa aaatggaaaa gcaaaaggag attaa 
945 



55 
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<210> 14 

<211> 314 

5 <212> PRT 

<213> Homo sapiens 



10 

<400> 14 



Met Gly Leu Leu Asp Ser Glu Pro Gly Ser Val Leu Asn Val Val Ser 
15 10 15 

15 

Thr Ala Leu Asn Asp Thr Val Glu Phe Tyr Arg Trp Thr Trp Ser lie 
20 25 30 

20 „ 

Ala Asp Lys Arg Val Glu Asn Trp Pro Leu Met Gin Ser Pro Trp Pro 
35 40 45 

25 Thr Leu Ser lie Ser Thr Leu Tyr Leu Leu Phe Val Trp Leu Gly Pro 
50 55 60 

Lys Trp Met Lys Asp Arg Glu Pro Phe Gin Met Arg Leu Val Leu lie 
30 65 70 75 80 

He Tyr Asn Phe Gly Met Val Leu Leu Asn Leu Phe He Phe Arg Glu 
• 85 90 95 

35 

Leu Phe Met Gly Ser Tyr Asn Ala Gly Tyr Ser Tyr He Cys Gin Ser 
100 105 110 

40 

Val Asp Tyr Ser Asn Asn Val His Glu Val Arg He Ala Ala Ala Leu 
115 120 125 

45 Trp Trp Tyr Phe Val Ser Lys Gly Val Glu Tyr Leu Asp Thr Val Phe 
130 135 140 

Phe He Leu Arg Lys Lys Asn Asn Gin Val Ser Phe Leu His Val Tyr 
50 145 150 155 160 

His His Cys Thr Met Phe Thr Leu Trp Trp He Gly He Lys Trp Val 
165 170 175 

55 

Ala Gly Gly Gin Ala Phe Phe Gly Ala Gin Leu Asn Ser Phe He His 
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180 185 190 

Val He Met Tyr Ser Tyr Tyr Gly Leu Thr Ala Phe Gly Pro Trp He 
5 195 200 205 

Gin Lys Tyr Leu Trp Trp Lys Arg Tyr Leu Thr Met Leu Gin Leu He 
210 215 220 

10 

Gin Phe His Val Thr He Gly His Thr Ala Leu Ser Leu Tyr Thr Asp 
225 230 235 240 

Cys Pro Phe Pro Lys Tip Met His Trp Ala Leu He Ala Tyr Ala He 
245 250 255 

20 Ser Phe He Phe Leu Phe Leu Asn Phe Tyr He Arg Thr Tyr Lys Glu 
260 265 270 

Pro Lys Lys Pro Lys Ala Gly Lys Thr Ala Met Asn Gly He Ser Ala 
25 275 280 285 

Asn Gly Val Ser Lys Ser Glu Lys Gin Leu Met He Glu Asn Gly Lys 
290 295 300 

30 

Lys Gin Lys Asn Gly Lys Ala Lys Gly Asp 
305 310 

35 

<210> 15 
<211> 2340 
40 <212> DNA 

<213> Homo sapiens 



45 

<400> 15 

gatagcgccg ggcagaggga cccggctacc ctggacagcg catcgccgcc cgcccgggtc 
60 

50 gccgcgccac agccgctgcg gatcatggaa catctaaagg cctttgatga tgaaatcaat 
120 

gcttttttgg acaatatgtt tggaccgcga gattctcgag tcagagggtg gttcacgttg 
180 

55 

gactcttacc ttcctacctt ttttcttact gtcatgtatc tgctctcaat atggctgggt 
240 
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aacaagtata tgaagaacag acctgctctt tctctcaggg gtatcctcac cttgtataat 
300 

5 cttggaatca cacttctctc cgcgtacatg ctggcagagc tcattctctc cacttgggaa 
360 

ggaggctaca acttacagtg tcaagatctt accagcgcag gggaagctga catccgggta 
420 

10 

gccaaggtgc tttggtggta ctatttctcc aaatcagtag agttcctgga cacaattttc 
480 

ttcgttttgc ggaaaaaaac gagtcagatt acttttcttc atgtatatca tcatgcttct 
15 540 

atgtttaaca tctggtggtg tgtcttgaac tggatacctt gtggacaaag tttctttgga 
600 

20 ccaacactga acagttttgt ccacattctt atgtactcct actatggact ttctgtgttt 
660 

ccatctatgc acaagtatct ttggtggaag aaatatctca cacaggctca gctggtgcag 
720 

25 

ttcgtgctca ccatcacgca caccatgagc gccgtcgtga aaccgtgtgg cttccccttc 
780 

ggttgtctca tcttccagtc atcttatatg ctaacgttag tcatcctctt cttaaatttt 
30 840 

tatgttcaga cataccgaaa aaagccaatg aagaaagata tgcaagagcc acctgcaggg 
900 

35 aaagaagtga agaatggttt ttccaaagcc tacttcactg cagcaaatgg agtgatgaac 
960 



40 



aagaaagcac aataaaaatg agtaacagaa aaagcacata tactagccta acagattggc 
1020 

ttgttttaaa gcaaagactg aattgaaggt tacatgtttt aggataaact aatttctttt 
1080 



gagttcataa atcatttgta cccagaatgt attaatatat tgctattagg ttaatctgtt 
45 1140 

aactgaatgc tttgatcagc attgaggtga tgctcacctc cgaggacctc agaactggtg 
1200 

50 cagcttctct ctccctccct cccacagact gaacctttcg ccagaagctg tccttataac 
1260 

gccttatacg catacacagc caggaaacgt ggagcattgt ttctcacaga gagtctccaa 
1320 

55 

ataaaaaggg ttttgttcag attaaaatgt ttacaacaaa atgttaatta tattctaaat 
1380 
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acagggtatg ttctaatcta tattaagcaa taatgccagt gcataatcat tccatttgtt 
1440 

5 cctttagcaa tcaaccccag aaaatattaa aatgggatca tacacagaag atagaaaaat 
1500 

ctagcaaaac ttctctttct gtaagccaga gtcttgtcta tcagattccc acaaccactc 



10 



25 



40 



1560 



ctgattctaa atttagtgat atggtaatga aattggtatt tattttaaat attagttatt 



1620 



ctaaggagaa aaaaatgctt ctgcaagatt ttcataattc aggggctgtg gataggattg 
15 1680 

ttcctctgtt tccctaatca ttcatctgtt catgtctccc tcttgtgcca gtcagcctag 
1740 

20 gttatacaga tgccatgctc cacaccacga gcagtgtaca aatctggctg cccgtttact 
1800 



ttctgagcaa gcactggagt ccactccgac ctttttcttt gaacatgcat gctgctggaa 
1860 

tatgtataaa tcagaactag cagaagtagc agagtgatgg gagcaaaata ggcactgaat 
1920 



tcgtcaactc ttttttgtga gcctacttgt gaatattacc tcagatacct gttgtcactc 
30 1980 

ttcacaggtt atttaagttc ttgaagctgg gaggaaaaag atggagtagc ttggaaagat 
2040 

35 tccagcactg agccgtgagc cggtcatgag ccacgataaa aaatgccagt ttggcaaact 
2100 

cagcactcct gttccctgct caggtatatg cgatctctac tgagaagcaa gcacaaaagt 
2160 



agaccaaagt attaatgagt atttcctttc tccataagtg caggactgtt actcactact 
2220 



aaactctacc aagaatggaa accaagaata ttttctgaag atttttttga agattaattt 
45 2280 

ataccctata aaataaaact tgttagcttc gatgaagtca aaaaaaaaaa aaaaaaaaaa 
2340 

50 

<210> 16 
<211> 891 
55 <212> DNA 
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<213> Homo sapiens 



5 <400> 16 

atggaacatc taaaggcctt tgatgatgaa atcaatgctt ttttggacaa tatgtttgga 
60 

ccgcgagatt ctcgagtcag agggtggttc acgttggact cttaccttcc tacctttttt 
10 120 

cttactgtca tgtatctgct ctcaatatgg ctgggtaaca agtatatgaa gaacagacct 
180 

15 gctctttctc tcaggggtat cctcaccttg tataatcttg gaatcacact tctctccgcg 
240 

tacatgctgg cagagctcat tctctccact tgggaaggag gctacaactt acagtgtcaa 
300 

gatcttacca gcgcagggga agctgacatc cgggtagcca aggtgctttg gtggtactat 
360 



20 



ttctccaaat cagtagagtt cctggacaca attttcttcg ttttgcggaa aaaaacgagt 
25 420 

cagattactt ttcttcatgt atatcatcat gcttctatgt ttaacatctg gtggtgtgtc 
480 

30 ttgaactgga taccttgtgg acaaagtttc tttggaccaa cactgaacag ttttgtccac 
540 



35 



50 



55 



attcttatgt actcctacta tggactttct gtgtttccat ctatgcacaa gtatctttgg 
600 

tggaagaaat atctcacaca ggctcagctg gtgcagttcg tgctcaccat cacgcacacc 
660 



atgagcgccg tcgtgaaacc gtgtggcttc cccttcggtt gtctcatctt ccagtcatct 
40 720 

tatatgctaa cgttagtcat cctcttctta aatttttatg ttcagacata ccgaaaaaag 
780 

45 ccaatgaaga aagatatgca agagccacct gcagggaaag aagtgaagaa tggtttttcc 
840 



aaagcctact tcactgcagc aaatggagtg atgaacaaga aagcacaata a 
891 



<210> 17 
<211> 296 
<212> PRT 
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<213> Homo sapiens 



<400> 17 



Met Glu His Leu Lys Ala Phe Asp Asp Glu lie Asn Ala Phe Leu Asp 
15 10 15 



Asn Met Phe Gly Pro Arg Asp Ser Arg Val Arg Gly Trp Phe Thr Leu 
20 25 30 



Asp Ser Tyr Leu Pro Thr Phe Phe Leu Thr Val Met Tyr Leu Leu Ser 
35 40 45 



lie Trp Leu Gly Asn Lys Tyr Met Lys Asn Arg Pro Ala Leu Ser Leu 
50 55 60 



Arg Gly lie Leu Thr Leu Tyr Asn Leu Gly lie Thr Leu Leu Ser Ala 
65 70 75 80 



Tyr Met Leu Ala Glu Leu lie Leu Ser Thr Trp Glu Gly Gly Tyr Asn 
85 90 95 



Leu Gin Cys Gin Asp Leu Thr Ser Ala Gly Glu Ala Asp lie Arg Val 
100 105 110 



Ala Lys Val Leu Trp Trp Tyr Tyr Phe Ser Lys Ser Val Glu Phe Leu 
115 120 125 



Asp Thr lie Phe Phe Val Leu Arg Lys Lys Thr Ser Gin lie Thr Phe 
130 135 140 



Leu His Val Tyr His His Ala Ser Met Phe Asn He Trp Trp Cys Val 
145 150 155 160 



Leu Asn Trp He Pro Cys Gly Gin Ser Phe Phe Gly Pro Thr Leu Asn 
165 170 175 



Ser Phe Val His He Leu Met Tyr Ser Tyr Tyr Gly Leu Ser Val Phe 
180 185 190 



Pro Ser Met His Lys Tyr Leu Trp Trp Lys Lys Tyr Leu Thr Gin Ala 
195 200 205 
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Gin Leu Val Gin Phe Val Leu Thr He Thr His Thr Met Ser Ala Val 
210 215 220 

Val Lys Pro Cys Gly Phe Pro Phe Gly Cys Leu He Phe Gin Ser Ser 
• 225 230 235 240 

10 Tyr Met Leu Thr Leu Val He Leu Phe Leu Asn Phe Tyr Val Gin Thr 

245 250 255 

Tyr Arg Lys Lys Pro Met Lys Lys Asp Met Gin Glu Pro Pro Ala Gly 
15 260 265 270 

• Lys Glu Val Lys Asn Gly Phe Ser Lys Ala Tyr Phe Thr Ala Ala Asn 
275 280 285 

20 

Gly Val Met Asn Lys Lys Ala Gin 
290 295 

25 

<210> 18 
<:211> 1561 
30 <212> DNA 

<213> Homo sapiens 



35 

<400> 18 

gcccagcaga tgaggaagtg gcaggcaggc aggctggccc cggggacttc tctctggccc 
60 

40 tgctccctcc gagcgctccg ccgttgcccg cctggcccct acggagtcct tagccaggat 
: 120 

ggaggctgtt gtgaacttgt accaagaggt gatgaagcac gcagatcccc ggatccaggg 
180 

45 

ctaccctctg atggggtccc ccttgctaat gacctccatt ctcctgacct acgtgtactt 
240 ' 

cgttctctca cttgggcctc gcatcatggc taatcggaag cccttccagc tccgtggctt 
50 300 

catgattgtc tacaacttct cactggtggc actctccctc tacattgtct atgagttcct 

360 

55 gatgtcgggc tggctgagca cctatacctg gcgctgtgac cctgtggact attccaacag 
420 
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ccctgaggca cttaggatgg ttcgggtggc ctggctcttc ctcttctcca agttcattga 
480 

gctgatggac acagtgatct ttattctccg aaagaaagac gggcaggtga ccttcctaca 
5 540 

tgtcttccat cactctgtgc ttccctggag ctggtggtgg ggggtaaaga ttgccccggg 
600 

10 aggaatgggc tctttccatg ccatgataaa ctcttccgtg catgtcataa tgtacctgta 
660 



15 



30 



45 



ctacggatta tctgcctttg gccctgtggc acaaccctac ctttggtgga aaaagcacat 
720 

gacagccatt cagctgatcc agtttgtcct ggtctcactg cacatctccc agtactactt 
780 



tatgtccagc tgtaactacc agtacccagt cattattcac ctcatctgga tgtatggcac 
20 840 

catcttcttc atgctgttct ccaacttctg gtatcactct tataccaagg gcaagcggct 
900 

25 gccccgtgca cttcagcaaa atggagctcc aggtattgcc aaggtcaagg ccaactgaga 
960 



agcatggcct agataggcgc ccacctaagt gcctcaggac tgcaccttag ggcagtgtcc 
1020 

gtcagtgccc tctccaccta cacctgtgac caaggcttat gtggtcagga ctgagcaggg 
1080 



gactggccct cccctcccca cagctgctct acagggacca cggctttggt tcctcaccca 
35 1140 

cttcccccgg gcagctccag ggatgtggcc tcattgctgt ctgccactcc agagctgggg 
1200 

40 gctaaaaggg ctgtacagtt atttccccct ccctgcctta aaacttggga gaggagcact 
1260 

cagggctggc cccacaaagg gtctcgtggc ctttttcctc acacagaaga ggtcagcaat 
1320 



aatgtcactg tggacccagt ctcactcctc caccccacac actgaagcag tagcttctgg 
1380 



gccaaaggtc agggtgggcg ggggcctggg aatacagcct gtggaggctg cttactcaac 
50 1440 

ttgtgtctta attaaaagtg acagaggaaa ccacggaggc tgtgtgtata cgtatgtgaa 
1500 

55 cagaggaggt gggggaacac aacacgctta gccctgtcca caaacaagag ggatggatgg 
1560 
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10 



t 

1561 



5 <210> 19 

<211> 840 

<212> DNA 

<213> Homo sapiens 



15 <400> 19 

atggaggctg ttgtgaactt gtaccaagag gtgatgaagc acgcagatcc ccggatccag 
60 

ggctaccctc tgatggggtc ccccttgcta atgacctcca ttctcctgac ctacgtgtac 
20 120 

ttcgttctct cacttgggcc tcgcatcatg gctaatcgga agcccttcca gctccgtggc 
180 

25 ttcatgattg tctacaactt ctcactggtg gcactctccc tctacattgt ctatgagttc 
240 



30 



45 



ctgatgtcgg gctggctgag cacctatacc tggcgctgtg accctgtgga ctattccaac 
300 

agccctgagg cacttaggat ggttcgggtg gcctggctct tcctcttctc caagttcatt 
360 



gagctgatgg acacagtgat ctttattctc cgaaagaaag acgggcaggt gaccttccta 
35 420 

catgtcttcc atcactctgt gcttccctgg agctggtggt ggggggtaaa gatfcgccccg 
480 

40 ggaggaatgg gctctttcca tgccatgata aactcttccg tgcatgtcat aatgtacctg 
540 

tactacggat tatctgcctt tggccctgtg gcacaaccct acctttggtg gaaaaagcac 
600 



atgacagcca ttcagctgat ccagtttgtc ctggtctcac tgcacatctc ccagtactac 
660 



tttatgtcca gctgtaacta ccagtaccca gtcattattc acctcatctg gatgtatggc 
50. 720 

accatcttct tcatgctgtt ctccaacttc tggtatcact cttataccaa gggcaagcgg 
780 

55 ctgccccgtg cacttcagca aaatggagct ccaggtattg ccaaggtcaa ggccaactga 
840 
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<210> 20 

<211> 279 

5 

<212> PRT 

<213> Homo sapiens 

10 

<400> 20 

Met Glu Ala Val Val Asn Leu Tyr Gin Glu Val Met Lys His Ala Asp 
15 1 5 10 15 

Pro Arg lie Gin Gly Tyr Pro Leu Met Gly Ser Pro Leu Leu Met Thr 
20 25 30 

20 

Ser lie Leu Leu Thr Tyr Val Tyr Phe Val Leu Ser Leu Gly Pro Arg 
35 40 45 

25 

lie Met Ala Asn Arg Lys Pro Phe Gin Leu Arg Gly Phe Met He Val 
50 55 ^0 

30 Tyr Asn Phe Ser Leu Val Ala Leu Ser Leu Tyr He Val Tyr Glu Phe 
65 70 75 80 

Leu Met Ser Gly Trp Leu Ser Thr Tyr Thr Trp Arg Cys Asp Pro Val 
35 85 90 95 



Asp Tyr Ser Asn Ser Pro Glu Ala Leu Arg Met Val Arg Val Ala Trp 
100 105 110 

40 

Leu Phe Leu Phe Ser Lys Phe He Glu Leu Met Asp Thr Val He Phe 
115 120 125 

45 

He Leu Arg Lys Lys Asp Gly Gin Val Thr Phe Leu His Val Phe His 
130 135 140 



50 His Ser Val Leu Pro Trp Ser Trp Trp Trp Gly Val Lys He Ala Pro 

145 150 155 160 

Gly Gly Met Gly Ser Phe His Ala Met He Asn Ser Ser Val His Val 

55 165 170 175 
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lie Met Tyr Leu Tyr Tyr Gly Leu Ser Ala Phe Gly Pro Val Ala Gin 
185 



180 



5 Pro Tyr Leu Trp Trp Lys Lys His Met Thr Ala He Gin Leu lie Gin 
195 200 205 

Phe val Leu Val Ser Leu His He Ser Gin Tyr Tyr Phe Met Ser Ser 
10 210 215 220 

Cys Asn Tyr Gin Tyr Pro Val He He His Leu He Trp Met Tyr Gly 

230 235 



225 
15 



Thr He Phe Phe Met Leu Phe Ser Asn Phe Trp Tyr His Ser Tyr Thr 
245 250 255 



Lys Gly Lys Arg Leu Pro Arg Ala Leu Gin Gin Asn Gly Ala Pro Gly 
260 265 



25 lie Ala Lys Val Lys Ala Asn 
275 



<210> 21 

30 

<211> 2393 

<212> DNA 

35 <213> Homo sapiens 



40 ggccggcgcc tcctcctgga ttcattcact cgctcttttc attcacgaag gtagtgaggc 
60 

ctagtggaaa gccatggaga gcgctctccc cgccgccggc ttcctgtact gggtcggcgc 
120 

gggcaccgtg gcctacctag ccctgcgtat ttcgtactcg ctcttcacgg ccctccgggt 
180 

ctggggagtg gggaatgagg cgggggtcgg cccggggctc ggagagtggg cagttgtcac 
50 240 

aggtagtact gatggaattg gaaaatcata tgcagaagag ttagcaaagc atggaatgaa 
300 

55 ggttgtcctt atcagcagat caaaggataa acttgaccag gtttccagtg aaataaaaga 
360 
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aaaattcaaa gtggagacaa gaaccattgc tgttgacttt gcatcagaag atatttatga 
420 

taaaattaaa acaggcttgg ctggtcttga aatcggcatc ttagtgaaca acgtgggaat 
5 480 

gtcgtatgag tatcctgaat actttttgga tgttcctgac ttggacaatg tgatcaagaa 
540 

10 aatgataaat attaatattc tttctgtttg taagatgaca caattggtac tgcctggcat 
600 



15 



30 



45 



ggtggaaaga tccaaagggg ctattctgaa catttcatct ggcagtggca tgctccctgt 
660 

cccactcttg accatctatt ctgcaaccaa gacttttgta gatttcttct ctcagtgcct 
720 



ccatgaggag tataggagca agggcgtctt tgtgcagagt gtcctgccat acttcgtagc 
20 780 

tacaaaactg gctaaaatcc ggaagccaac tttggataag ccctctccgg agacgtttgt 
840 

25 gaagtctgca attaaaacag tcggcctgca atcccgaacc aatggatacc tgatccatgc 
900 

tcttatgggc tcgataatct caaacctgcc ttcttggatt tatttgaaaa tagtcatgaa 
960 



tatgaacaag tctacacggg ctcactatct gaagaaaacc aagaagaact aagcatgata 
1020 



actgcattgt aacttggcca gatgctccag catatgcacg ttcactgcaa agcaccctac 
35 1080 

tggttttgaa aatctgacct tgtcatttca atagttatta acatgactaa atattatctt 
1140 

40 aattaagagg aaaatagaag ttgcttttag gggtttctga catatattct ggatactatc 
1200 

cgaggtaatt ttgaagttta atataaatgc tcatatcaaa tgaatataga actaatattg 
1260 



tcgggaacac ctaatagaaa ggaatactat tatagcaaat cacagaatga tagactcaag 
1320 



cataaaactt ggcagtttta tctgcttcaa aatgccattg atcattattc ctgtattttc 
50 1380 

tctgaaactg attataaaaa ccaatgtcca gctactcttt tgtttttgac acttgaagaa 
1440 

55 atggagatcg atttgatttg tttataagca gacacactgc aatttacaaa gatctcttta 
1500 
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cggttttata aaattatctt ccagtttgta catttatatg gaattgttct ttatcaaggg 
1560 

tagctaatga catgaaaata attgtgaaat atggaattat ttctgacaca tgaagcccac 
5 1620 

taaactatgc tttcctataa tgcatatttc ttctcagttt aaatgtatgt aaatatcgaa 



10 



15 



30 



40 



45 



50 



55 



1680 



gctatatggt atgatttata aagataaatg ggocaaagtg tacattgaga ctggcagcca 



1740 



tctatggtac cactgaaacc ctgacccaga aaagtggctt gcttggacac ccagctgcct 



1800 



ttgtttctgc attaaaccaa tattgatcac acatatgaca caggctagtc ctataaaagt 
1860 

aatgacttca tagaaatggc attataattt ttaagttgat actctacagg tagctattga 
20 1920 

tataattagt tttaataaaa catgctgcaa ccatggtata caacaaaaat acatttcttt 
1980 

25 ggtgattgaa attaaggccg tatttacaat gacttaatat aagactgact tttatcctgc 
2040 



ttcataactt gtatggagaa ctcaccaaga aagaattcaa tactgtgaaa tatgcagcaa 
2100 

gaagattggt ctttacctag gctgtgtttc ctaagctctg agttttcagc accagtagat 
2160 

ttgtattaaa agaaaaaaaa atggggcctt agcttctggc ttttaatttt gccagctaag 
35 2220 

gacataaaac aaaaataaac aaacaaaaac aaatagccat ctgctatcag catcattatg 



2280 

taaaagaaaa tatattttag cccctaaaat taggaagaat gtaatctcag aataaaggtt 
2340 

gtcatttaag ttgaataaat atatagcttt atgaaaaaca caaaaaaaaa aaa 
2393 



<210> 22 

<211> 939 

<212> DNA 

<213> Homo sapiens 



<400> 22 
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atggagagcg ctctccccgc cgccggcttc 
60 

tacctagccc tgcgtatttc gtactcgctc 
120 

aatgaggcgg gggtcggccc ggggctcgga 
180 

ggaattggaa aatcatatgc agaagagtta 
240 

agcagatcaa aggataaact tgaccaggtt 
300 

gagacaagaa ccattgctgt tgactttgca 
360 

ggcttggctg gtcttgaaat cggcatctta 
420 

cctgaatact ttttggatgt tcctgacttg 
480 

aatattcttt ctgtttgtaa gatgacacaa 
540 

aaaggggcta ttctgaacat ttcatctggc 
600 

atctattctg caaccaagac ttttgtagat 
660 

s-ggagcaagg gcgtctttgt gcagagtgtc 
720 

aaaatccgga agccaacttt ggataagccc 
780 

aaaacagtcg gcctgcaatc ccgaaccaat 
840 

ataatctcaa acctgccttc ttggatttat 
900 

acacgggctc actatctgaa gaaaaccaag 
939 



<210> 23 

<211> 312 

<212> PRT 

<213> Homo sapiens 



ctgfcactggg tcggcgcggg caccgtggcc 
ttcacggccc tccgggtctg gggagtgggg 
gagtgggcag ttgtcacagg tagtactgat 
gcaaagcatg gaatgaaggt tgtccttatc 
tccagtgaaa taaaagaaaa attcaaagtg 
tcagaagata tttatgataa aattaaaaca 
gtgaacaacg tgggaatgtc gtatgagtat 
gacaatgtga tcaagaaaat gataaatatt 
ttggtactgc ctggcatggt ggaaagatcc 
agtggcatgc tccctgtccc actcttgacc 
ttcttctctc agtgcctcca tgaggagtat 
ctgccatact tcgtagctac aaaactggct 
tctccggaga cgtttgtgaa gtctgcaatt 
ggatacctga tccatgctct tatgggctcg 
ttgaaaatag tcatgaatat gaacaagtct 
aagaactaa 
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<400> 23 

5 Met Glu Ser Ala Leu Pro Ala Ala Gly Phe Leu Tyr Trp Val Gly Ala 
15 10 15 



Gly Thr Val Ala Tyr Leu Ala Leu Arg He Ser Tyr Ser Leu Phe Thr 
10 20 25 30 



Ala Leu Arg Val Trp Gly Val Gly Asn Glu Ala Gly Val Gly Pro Gly 
35 40 45 

15 

Leu Gly Glu Trp Ala Val Val Thr Gly Ser Thr Asp Gly He Gly Lys 
SO 55 60 

20 

Ser Tyr Ala Glu Glu Leu Ala Lys His Gly Met Lys Val Val Leu He 
65 70 75 80 



25 Ser Arg Ser Lys Asp Lys Leu Asp Gin Val Ser Ser Glu He Lys Glu 

85 90 95 



Lys Phe Lys Val Glu Thr Arg Thr He Ala Val Asp Phe Ala Ser Glu 
30 100 105 110 



Asp He Tyr Asp Lys He Lys Thr Gly Leu Ala Gly Leu Glu He Gly 
115 120 125 

35 

He Leu Val Asn Asn Val Gly Met Ser Tyr Glu Tyr Pro Glu Tyr Phe 
130 135 140 

40 

Leu Asp Val Pro Asp Leu Asp Asn Val He Lys Lys Met He Asn He 
145 150 155 160 



45 Asn He Leu Ser Val Cys Lys Met Thr Gin Leu Val Leu Pro Gly Met 

165 170 175 



Val Glu Arg Ser Lys Gly Ala He Leu Asn He Ser Ser Gly Ser Gly 
50 180 185 190 



Met Leu Pro Val Pro Leu Leu Thr He Tyr Ser Ala Thr Lys Thr Phe 
195 200 205 

55 

Val Asp Phe Phe Ser Gin Cys Leu His Glu Glu Tyr Arg Ser Lys Gly 
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15 



210 215 220 

Val Phe Val Gin Ser Val Leu Pro Tyr Phe Val Ala Thr Lys Leu Ala 
5 225 230 235 240 

Lys lie Arg Lys Pro Thr Leu Asp Lys Pro Ser Pro Glu Thr Phe Val 
245 250 255 

10 

Lys Ser Ala He Lys Thr Val Gly Leu Gin Ser Arg Thr Asn Gly Tyr 
260 265 270 

Leu He His Ala Leu Met Gly Ser He He Ser Asn Leu Pro Ser Trp 
275 280 285 

20 He Tyr Leu Lys He Val Met Asn Met Asn Lys Ser Thr Arg Ala His 
290 295 300 



Tyr Leu Lys Lys Thr Lys Lys Asn 
25 305 310 



<210> 24 

30 <2il> 629 

<212> DJSTA 

<213> Homo sapiens 

35 



<400> 24 

tgggagggag ccatgaagca ttacgaggtg gagattctgg acgcaaagac aagggagaag 
40 60 

ctgtgtttct tcgccgcgtg gatggcctat tacatcaatc accctctcta cactccccct 
120 

45 acctacggag ctcagcaggt gaaactggcg ctcgccatct ttgtgatctg ccagctcggc 
180 

aacttctcca tccacatggc cctgcgggac ctgcggcccg ctgggtccaa gacgcggaag 
240 

atcccatacc ccaccaagaa ccccttcacg tggctcttcc tgctggtgtc ctgccccaac 
300 

tgcacctacg aggtggggtc ctggatcggt ttcgccatca tgacgcagtg tctcccagtg 
55 360 
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gccctgttct ccctggtggg cttcacccag atgaccatct gggccaaggg caagcaccgc 
420 

agctacctga aggagttccg ggactacccg cccctgcgca tgcccatcat ccccttcctg 
5 480 

ctctgagcag ctcacccctg ctgaggctca gcccctcaac ccggtggcat tctgggggag 
540 

10 gagtggggcc cacagctctc cagcacccgg aataaagccc gcctgtcccc agtcggaaca 
600 



15 



20 



aaaaaaaaaa aaaaaaaaaa aaaaaaaaa 
629 



<210> 25 
<211> 474 
<212> DNA 
<213> Homo sapiens 

25 

<400> 25 

atgaagcatt acgaggtgga gattctggac gcaaagacaa gggagaagct gtgtttcttc 
60 

gccgcgtgga tggcctatta catcaatcac cctctctaca ctccccctac ctacggagct 
120 

cagcaggtga aactggcgct cgccatcttt gtgatctgcc agctcggcaa cttctccatc 
35 180 

cacatggccc tgcgggacct gcggcccgct gggtccaaga cgcggaagat cccatacccc 
240 

40 accaagaacc ccttcacgtg gctcttcctg ctggtgtcct gccccaactg cacctacgag 
300 

gtggggtcct ggatcggttt cgccatcatg acgcagtgtc tcccagtggc cctgttctcc 
360 

ctggtgggct tcacccagat gaccatctgg gccaagggca agcaccgcag ctacctgaag 

420 



30 



45 



gagttccggg actacccgcc cctgcgcatg cccatcatcc ccttcctgct ctga 
50 474 



<210> 26 
55 <211> 157 
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<212> PRT 

<213> Homo sapiens 

<400> 26 

Met Lys His Tyx Glu Val Glu lie Leu Asp Ala Lys Thr Arg Glu Lys 
15 10 15 



Leu Cys Phe Phe Ala Ala Tirp Met Ala Tyr Tyr lie Asn His Pro Leu 
20 25 30 



Tyr Thr Pro Pro Thr Tyr Gly Ala Gin Gin Val Lys Leu Ala Leu Ala 
35 . 40 . 45 



lie Phe Val He Cys Gin Leu Gly Asn Phe Ser He His Met Ala Leu 
50 55 60 



Arg Asp Leu Arg Pro Ala Gly Ser Lys Thr Arg Lys He Pro Tyr Pro 
65 70 75 80 



Thr Lys Asn Pro Phe Thr Trp Leu Phe Leu Leu Val Ser Cys Pro Asn 
85 90 95 



Cys Thr Tyr Glu Val Gly Ser Trp He Gly Phe Ala He Met Thr Gin 
100 105 110 



Cys Leu Pro Val Ala Leu Phe Ser Leu Val Gly Phe Thr Gin Met Thr 
115 120 125 



He Trp Ala Lys Gly Lys His Arg Ser Tyr Leu Lys Glu Phe Arg Asp 
130 135 140 



Tyr Pro Pro Leu Arg Met Pro He He Pro Phe Leu Leu 
145 150 155 



<210> 27 

<211> 1115 . 

<212> DNA 

<213> artificial sequence 
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<220> 

5 <223> RNAi fragment 
<400> 27 

tgccagtgct tcttggttgg agccggatac 
SO 

10 

ttgacgattt tctcgaatat tttgggccca 
120 

aaaagagctg gagcttcttg ggctggttcg 
15 180 

tcgtatagca ctcagagaac tacttttggt 
240 

20 aataaaattg caatgaaaaa attgttgcgt 
300 

caaaaaatac atttttcagt tgtcaccgga 
360 

25 

ttcgaattgg ctcgtcgtgg attcaatgtc 
420 

gatgagacga agaaggagat tctcgagaag 
30 480 

ttcgacttca ccaacgctgc tccttctgct 
540 

35 gtagagatcg gagttcttat taacaacgtt 
600 

cacaaagttg acggtggaat cgagcgtctt 
660 

40 

ccaacattgg tgagttttat tagaattgta 
720 

ctcagctctc cgccggaatc cttccacaaa 
45 780 

atgttggatc ttcagctgga gcaaatcaaa 
840 

50 aggtttgaat ttcaaaatat cttccgtctt 
900 

tcctggctca ccgctatcct ccgaaaagaa 
960 

55 

attgctccaa tgatggtcgc cacaaagatg 
1020 



gtggctcttg cagctgtggc ttatcgtctt 
tacgttcttc tgtcgccaat cgatttgaag 
tttctatatt tttactctgg tgatgtctta 
tctctggaac tgttatcatt ttcaattgat 
aaatataaaa taattgtttt gaatttcaaa 
gccactgacg gaatcggaaa agcatacgcc 
ctgctcgttt cgcgtaccca atcaaaactc 
tattccagca ttgaggtccg cactgccgcc 
tacaaagatc ttctcgccac cttgaaccaa 
ggaatgagct acgaatatcc agatgtactt 
gcaaacatca ccaccatcaa cactcttcca 
ctattgtgat cttactgaaa aagttcttaa 
tggtcgcacg aaaggctgga gtcattgtta 
tggctctctg ggctgtgtat tcagctacaa 
ttaattataa ttttattcca gaagtatgtc 
tatgaacatc aaggaatcac tgtccaaact 
tcaaaagtca agagaacttc attcttcact 
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ccagacggag ccgtgttcgc taaatcagct ctgaacactg ttggaaatac ctcagacacc 
1080 



5 accggataca tcacgcatca acttcaactc gagct 
1115 



<210> 28 

10 

<211> 20 

<212> DNA 

15 <213> artificial secjuence 



20 



50 



<220> 

<223> primer 



<400> 28 

tgccagtgct tcttggttgg 
25 20 



<210> 29 

30 <211> 22 

<212> DNA 

<213> artificial sequence 

35 

<220> 

40 <223> primer 

<400> 29 

agctcgagtt gaagttgatg eg 
22 

45 

<210> 30 

<211> 951 

<212> DNA 

<213> Caenorhabditis elegans 



55 



<400> 30 
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atggcttgcc agtgcttctt ggttggagcc 
. 60 

cgtcttttga cgattttctc gaatattttg 
5 120 

ttgaagaaaa gagctggagc ttcttgggct 
180 

10 aaagcatacg ccttcgaatt ggctcgtcgt 
; 240 

caatcaaaac tcgatgagac gaagaaggag 
300 

15 

cgcactgccg ccttcgactt caccaacgct 
360 

accttgaacc aagtagagat cggagttctt 
20 42 0 

ccagatgtac ttcacaaagt tgacggtgga 
480 

25 aacactcttc caccaacatt gctctccgcc 
540 

gctggagtca ttgttaatgt tggatcttca 
600 

30 

gtgtattcag ctacaaagaa gtatgtctcc 
660 

gaacatcaag gaatcactgt ccaaactatt 
35 720 

aaagtcaaga gaacttcatt cttcactcca 
780 

40 aacactgttg gaaatacctc agacaccacc 
840 

ctcatggatc tcattccaac attcatccgc 
900 

45 

actcgtgctg ctgctctcag aaagaaggaa 
951 



50 <210> 31 

<211> 3488 

<212> DWA 

55 

<213> Homo sapiens 



ggatacgtgg ctcttgcagc tgtggcttat 
ggcccatacg ttcttctgtc gccaatcgat 
gttgtcaccg gagccactga cggaatcgga 
ggattcaatg tcctgctcgt ttcgcgtacc 
attctcgaga agtattccag cattgaggtc 
gctccttctg cttacaaaga tcttctcgcc 
attaacaacg ttggaatgag ctacgaatat 
atcgagcgtc ttgcaaacat caccaccatc 
ggaatccttc cacaaatggt cgcacgaaag 
gctggagcaa atcaaatggc tctctgggct 
tggctcaccg ctatcctccg aaaagaatat 
gctccaatga tggtcgccac aaagatgtca 
gacggagccg tgttcgctaa atcagctctg 
ggatacatca cgcatcaact tcaactcgag 
gacaagatcc tcacaaatat gagtgtcgga 
agagaagcca aatctcagta a 
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<400> 31 

agcagctccg gcggccgaga cgggggcggc ggccgcgcgg gtctggcggg accggtttgg 
5 60 

aagactttgc cggcctgcag attggcctta agagaaggac ggagccacat actgctgacg 
120 

10 gcccagaact ggcagagaga aggttgccat ggctgctgtt gacagtttct acctcttgta 
180 

cagggaaatc gccaggtctt gcaattgcta tatggaagct ctagctttgg ttggagcctg 
240 

15 

gtatacggcc agaaaaagca tcactgtcat ctgtgacttt tacagcctga tcaggctgca 
300 

ttttatcccc cgcctgggga gcagagcaga cttgatcaag cagtatggaa gatgggccgt 
20 360 

tgtcagcggt gcaacagatg ggattggaaa agcctacgct gaagagttag caagccgagg 
420 

25 tctcaatata atcctgatta gtcggaacga ggagaagttg caggttgttg ctaaagacat 
480 

agccgacacg tacaaagtgg aaactgatat tatagttgcg gacttcagca gcggtcgtga 
54 0 

30 

gatctacctt ccaattcgag aagccctgaa ggacaaagac gttggcatct tggtaaataa 
600 

cgtgggtgtg ttttatccct acccgcagfca tttcactcag ctgtccgagg acaagctctg 
35 660 

ggacatcata aatgtgaaca ttgccgccgc tagtttgatg gtccatgttg tgttaccggg 
720 

40 aatggtggag agaaagaaag gtgccatcgt cacgatctct tctggctcct gctgcaaacc 
780 

cactcctcag ctggctgcat tttcfcgcttc taaggcttat ttagaccact tcagcagagc 
840 

45 

cttgcaatat gaatatgcct ctaaaggaat ctttgtacag agtctaatcc ctttctatgt 
900 

agccaccagc atgacagcac ccagcaactt tctgcacagg tgctcgtggt tggtgccttc 
50 960 

gccaaaagtc tatgcacatc atgctgtttc tactcttggg atttccaaaa ggaccacagg 
1020 

55 atattggtcc cattctattc agtttctttt tgcacagtat atgcctgaat ggctctgggt 
1080 



w 
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gtggggagca aatattctca accgttcact acgtaaggaa gccttatgct gcacagcctg 
1140 

agtctggatg gccacttgag aagttttgcc aactcctggg aacctcgata ttctgacatt 
5 1200 

tggaaaaaca catttaattt atctcctgtg tttcattgct gattattcag catactgttg 
1260 

10 attcgtcatt tgcaaaacac acataatacc gtcagagtgc tgtgaaaaac cttaagggtg 
1320 

tgtggatggc acaggatcaa taatgcctga ggctgattga cgacatctac atttcagtgc 
1380 

15 

tttttcccta agctgtttga aagttacgct tttctgttgt tctagagcca cagcagtcta 
1440 

atattgaaat ataatatgat ttgtcaggtc ttataatttc agatgttgtt ttttaaggga 
20 1500 

aattgaccat ttcactagag gagttgtgct ggtttttaaa tgtgcatcaa gaaagactac 
1560 

25 tgaaaagtat tattttgtaa ctaagattgc tggtactatt aggaaaaatc tgtgtgtatc 
1620 

gtatagctct agctgtttga ctatctgtaa tgaaaatgct gcacttcaac tggtatttca 
1680 

30 

ttagagaacc gtgtgtgtgc gtgtgtgtgg tgcctttgag caactttatt tatggttacc 
1740 

atatttttaa aaagattttt tgtcaggatg acttaacatg gactcttata gggtattaaa 
35. 1800 

acaatctaga ttattccttt tcatcctaaa taagcctacc aaatttcatg ctgttggttt 
1860 

40 gccatgaatg atattacttc ctacattata tttgtgtttt ttcaaatctg ctatggaatg 
1920 

aacttattcc tagatttgga tatgtaagag aaacctgcag tcatcttttg atttataagg 
1980 

45 

caattcttgt ggataaatag tgatttctca gcctctgacc cattttataa ctgaaattta 
2040 

gccctttaga gcttgttata tctggttttc ctacgttttt ctatgtaata ttattccatt 
50 2100 

ccagtagcat tattgataga aatagtaagt atttatggaa tagtaaaata tggacaaatt 
2160 

55 acgtgtgtga catatctgtc aaaataagtt agaagcttat tcttggtttg tgtaatgaat 
2220 
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ttatgtattg tagtgaatac ctttactggt gtgaagataa ttatgcacaa accctcacaa 
2280 

tacgcgttaa cattgaaacc tgtgaaatgt ccttaggttg ggtcatataa agccaaccat 
5 2340 

ttttgaggac catgtaccta gtgctttgaa aactgtaagt cactatatga atatgacaat 
2400 

10 atgtgcacat ttaaaattca gagctcggca ttgtgatact gatgcagaag ctagtagatt 
2460 

ggttaaaagt ctggacttct gtggcatttt tttcgtgacg tgataatcta tcataagcag 
2520 

15 

acctaagcac agttttatga acacaatttt gcccatgaca ttgcctacag gatttccaga 
2580 

tgtgacttgc actcagaaga tcagtggtca acttcagaag ctcttccacg cttagatcat 
20 2640 

gtcttcagaa cttagatgtg aaaatctaca cactgggaga tgctgtgagc cccaaggttt 
2700 

25 tgatggagtt tgcttggaat cctcttgact tcatgccaca ttgacgtgaa ctttgatgta 
2760 

taataagcag cagcaacttc atgtgaaaat atggtcaggt agttatatgt aaggttacgt 
2820 

30 

ggtccagtaa tgtcttagat tgataaatta ggtatggaat ccatcagtgt tacgtgatga 
2880 

gaataggtga acacaccttg tcagtgatga tgtaaacttc tctccttggc agggcatggg 
35 2940 

caaacatgct gattggtgca aatgtggtgc cgagctgtcc atagctgcag tgaaaggtga 
3000 

40 agagcaagac cttctctagg ttttctagct ttcattaaat gtattttttt ccccagagct 
3060 

aatttgaaag ttgattggac cactgtggat ggggtctcat taagaatgtg ggaaataggg 
3120 

45- 

gccgagtgcg gtggctcaca cctgtaatcc cagcagtttg gaaggccagg gcaggtggat 
3180 

cgcttgatcc caggaggtcg agaccagcct ggggaacaca tcctgtctct acaaaaaata 
50 3240 

caaaaattag ccaggcaggg tggtgcatgc ctgtagtccc agctacttgg gaggctgagg 
3300 

55 caggagaatt ttttgagccc aggatgcaga ggttgaagtg agccaagatc gtgccactgc 
3360 
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actccagcct tgagacagag cgagaccctg tctcaaaaaa aaaaaaagaa cgtgggaaat 
3420 

atgaaccttt gaaagttaat ctgtgaattg aaagtttaac aataaaagta gttgtttgtt 
5 3480 

tcctttgg 
3488 

10 

<210> 32 

<211> 993 

15 <212> DNA 

<213> Homo sapiens 



20 

<400> 32 

atggctgctg ttgacagttt ctacctcttg tacagggaaa tcgccaggtc ttgcaattgc 
60 

25 tatatggaag ctctagcttt ggttggagcc tggtatacgg ccagaaaaag catcactgtc 
120 

atctgtgact tttacagcct gatcaggctg cattttatcc cccgcctggg gagcagagca 
180 



30 



45 



gacttgatca agcagtatgg aagatgggcc gttgtcagcg gtgcaacaga tgggattgga 
240 



aaagcctacg ctgaagagtt agcaagccga ggtctcaata taatcctgat tagtcggaac 
35 300 

gaggagaagt tgcaggttgt tgctaaagac atagccgaca cgtacaaagt ggaaactgat 
360 

40 attatagttg cggacttcag cagcggtcgt gagatctacc ttccaattcg agaagccctg 
420 



aaggacaaag acgttggcat cttggtaaat aacgtgggtg tgttttatcc ctacccgcag 
480 

tatttcactc agctgtccga ggacaagctc tgggacatca taaatgtgaa cattgccgcc 
540 



gctagtttga tggtccatgt tgtgttaccg ggaatggtgg agagaaagaa aggtgccatc 
50 600 

gtcacgatct cttctggctc ctgctgcaaa cccactcctc agctggctgc attttctgct 
660 

55 tctaaggctt atttagacca cttcagcaga gccttgcaat atgaatatgc ctctaaagga 
720 
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atctttgtac agagtctaat ccctttctat gtagccacca gcatgacagc acccagcaac 
780 

tttctgcaca ggtgctcgtg gttggtgcct tcgccaaaag tctatgcaca tcatgctgtt 
840 

tctactcttg ggatttccaa aaggaccaca ggatattggt cccattctat tcagtttctt 
900 

tttgcacagt atatgcctga atggctctgg gtgtggggag caaatattct caaccgttca 
960 

ctacgtaagg aagccttatg ctgcacagcc tga 
993 



<210> 33 

<211> 330 

<212> PRT 

<213> Homo sapiens 

<400> 33 

Met Ala Ala Val Asp Ser Phe Tyr Leu lieu Tyr Arg Glu lie Ala Arg 
15 10 15 



Ser Cys Asn Cys Tyr Met Glu Ala Leu Ala Leu Val Gly Ala Trp Tyr 
20 25 30 



Thr Ala Arg Lys Ser lie Thr Val lie Cys Asp Phe Tyr Ser Leu lie 
35 40 45 



Arg Leu His Phe lie Pro Arg Leu Gly Ser Arg Ala Asp Leu lie Lys 
50 55 60 



Gin Tyr Gly Arg Trp Ala Val Val Ser Gly Ala Thr Asp Gly lie Gly 
65 70 75 80 



Lys Ala Tyr Ala Glu Glu Leu Ala Ser Arg Gly Leu Asn He He Leu 
85 90 95 



He Ser Arg Asn Glu Glu Lys Leu Gin Val Val Ala Lys Asp He Ala 
100 105 110 



Asp Thr Tyr Lys Val Glu Thr Asp He He Val Ala Asp Phe Ser Ser 
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115 



120 



125 



Gly Arg Glu lie Tyx Leu Pro lie Arg Glu Ala Leu Lys Asp Lys Asp 
130 135 140 



Val Gly lie Leu Val Asn Asn Val Gly Val Phe Tyr Pro Tyr Pro Gin 
145 150 155 160 



Tyr Phe Thr Gin Leu Ser Glu Asp Lys Leu Trp Asp lie lie Asn Val 
165 170 175 



Asn lie Ala Ala Ala Ser Leu Met Val His Val Val Leu Pro Gly Met 
180 185 190 



Val Glu Arg Lys Lys Gly Ala He Val Thr He Ser Ser Gly Ser Cys 
195 200 205 



Cys Lys Pro Thr Pro Gin Leu Ala Ala Phe Ser Ala Ser Lys Ala Tyr 
210 215 220 



Leu Asp His Phe Ser Arg Ala Leu Gin Tyr Glu Tyr Ala Ser Lys Gly 
225 230 235 240 



He Phe Val Gin Ser Leu He Pro Phe Tyr Val Ala Thr Ser Met Thr 
245 250 255 



Ala Pro Ser Asn Phe Leu His Arg Cys Ser Trp Leu Val Pro Ser Pro 
260 265 270 



Lys Val Tyr Ala His His Ala Val Ser Thr Leu Gly He Ser Lys Arg 
275 280 285 



Thr Thr Gly Tyr Trp Ser His Ser He Gin Phe Leu Phe Ala Gin Tyr 
290 295 300 



Met Pro Glu Trp Leu Trp Val Trp Gly Ala Asn He Leu Asn Arg Ser 
305 310 315 320 



Leu Arg Lys Glu Ala Leu Cys Cys Thr Ala 
325 330 
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Method for identifying a compound useful for prevention or 
treatment of a metabolic disorder comprising : contacting a 
test chemical with an amino acid sequence involved in the 
elongation of faty acids* or a host cell or animal 
expressing such a sequence, and detecting interaction 
between both; wherein the amino acid sequence is that of SEQ 
ID n"" 3 (human EL0VL6) or its mouse orthologue LCE. 



2. claims: 1-19 (partially) 

Method for identifying a compound useful for prevention or 
treatment of a metabolic disorder comprising : contacting a 
test chemical with an amino acid sequence involved in the 
elongation of faty acids, or a host cell or animal 
expressing such a sequence, and detecting interaction 
between both; wherein the amino acid sequence is that of 
SEQ ID n"" 5 (further putative human enzyme involved in 
fatty acid elongation) 



3. claims: 1-19 (partially) 

Method for identifying a compound useful for prevention or 
treatment of a metabolic disorder comprising : contacting a 
test chemical with an amino acid sequence involved in the 
elongation of faty acids, or a host cell or animal 
expressing such a sequence, and detecting interaction 
between both; wherein the amino acid sequence is that of 
SEQ ID n"* 8 (human EL0VL3) or its mouse orthologue Cig30. 



4. claims: 1-19 (partially) 

Method for identifying a compound useful for prevention or 
treatment of a metabolic disorder comprising : contacting a 
test chemical with an amino acid sequence involved in the 
elongation of faty acids, or a host cell or animal 
expressing such a sequence, and detecting interaction 
between both; wherein the amino acid sequence is that of 
SEQ ID n^ 11 (human EL0VL5) 
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Method for identifying a compound useful for prevention or 
treatment of a metabolic disorder comprising : contacting a 
test chemical with an amino acid sequence involved in the 
elongation of faty acids, or a host cell or animal 
expressing such a sequence, and detecting interaction 
between both; wherein the amino acid sequence is that of 
SEQ ID n^ 14 (human EL0VL4) 

6. claims: 1-19 (partially) 

Method for identifying a compound useful for prevention or 
treatment of a metabolic disorder comprising : contacting a 
test chemical with an amino acid sequence involved in the 
elongation of faty acids, or a host cell or animal 
expressing such a sequence, and detecting interaction 
between both; wherein the amino acid sequence is that of 
SEQ ID n"* 17 (human EL0VL2) or its mouse orthologue Ssc2. 



7. claims: 1-19 (partially) 

Method for identifying a compound useful for prevention or 
treatment of a metabolic disorder comprising : contacting a 
test chemical with an amino acid sequence involved in the 
elongation of faty acids, or a host cell or animal 
expressing such a sequence, and detecting interaction 
between both; wherein the amino acid sequence is that of 
SEQ ID n"" 20 (human ELOVLl) or its mouse orthologue Sscl. 



8. claims: 1-19 (partially) 

Method for identifying a compound useful for prevention or 
treatment of a metabolic disorder comprising : contacting a 
test chemical with an amino acid sequence involved in the 
elongation of faty acids, or a host cell or animal 
expressing such a sequence, and detecting interaction 
between both; wherein the amino acid sequence is that of 
SEQ ID n° 23 (human KAR) 



9. claims: 1-19 (partially) 

Method for identifying a compound useful for prevention or 
treatment of a metabolic disorder comprising : contacting a 
test chemical with an amino acid sequence involved in the 
elongation of faty acids, or a host cell or animal 
expressing such a sequence, and detecting interaction 
between both; wherein the amino acid sequence is that of 
SEQ ID n*" 26 (human TER) 
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Method for identifying a compound useful for prevention or 
treatment of a metabolic disorder comprising : contacting a 
test chemical with an amino acid sequence involved in the 
elongation of faty acids » or a host cell or animal 
expressing such a sequence, and detecting interaction 
between both; wherein the amino acid sequence is that of 
SEQ ID n° 33 (homologue of KAR) 
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